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(54) Thumbnail Image manipulation using aspect ratio zooming, compression and scaling 



(57) . Disclosed are method and apparatus for the 
manipulation of thumbnail images as used in image- 
based browsing file management systems. Disclosed 
are arrangements whereby zooming in and out of 
thumbnail Images can be performed without a continued 
need to decompress a true image thus providing for fast- 
er operation. Pixel interpolation and/or replication are 



used to generate intermediate images that are display 
to deliver to the user a perception of a transitory zoom 
yet are of sufficient detail to maintain user orientation. 
Aspect ratio zooming of thumbnail containment areas is 
also disclosed which facilitates ease of browsing. The 
compression of thumbnail type images using a discrete 
wavelet transform facilitates the fast zoom of thumbnails 
and their associated containment areas. 
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Description 

FIELD OF THE INVENTION 

s The present invention relates to the display of images on a display device and, in particular, to a fast process for 

zooming-in and out of images, to the layout and associated zooming of image thumbnails for image browsing, and to 
the encoding and scaling of image thumbnails for fast display of multiple images at varying resolutions in applications 
such as image browsing. 

10 BACKGROUND 

As the price of digital cameras continues to fall, and as personal computers become Increasingly more powerful, 
the number of digital images managed by home and small business computer users is set to increase dramatically in 
the very near future. Despite this, the need for software image browsers to move beyond the limitations of the current 
IS design conventions to support effective browsing of many thousands of images has received relatively little attention. 
Further, despite the large number of digital images commonly encountered by users, fixed size Image thumbnails or 
Image icons continue to be used as the basis for designing conventional image browsing applications. 

Existing image browsers are based on the concept of displaying fixed size image thumbnails to users. Image 
thumbnails are small copies of actual image files that the thumbnails represent. Typically, thumbnails are displayed at 
20 between 80 x 80 pixels and 1 20 x 1 20 pixels. This allows visual display screens to display between 9 and 25 thumbnails 
at any one time depending on the screen and thumbnail resolutions, and the interface design. Consequently, when 
browsing large numbers of image thumbnails, users are unable to see more than a small proportion of the images of 
interest and are typically forced to scroll through vast arrays of thumbnails. This results in significant browsing and 
orientation problems. 

2S The fixed resolution at which designers of existing image browsers choose to display thumbnails represents a 

trade off between the two primary uses of thumbnails in image browsers, those being browsing image content and 
assessing Image content. Browsing image content is defined here as briefly scanning or glancing at image thumbnails 
to gain an overview of the images available. Browsing content is limited by the number of thumbnails that can be 
displayed at once. Assessing image content is a closer examination of image thumbnails and the assessment of detail, 

30 and Is limited by the resolution of thumbnails that can be displayed. 

Conventional approaches to the use of thumbnails have a number of disadvantages including the display of only 
a portion of fixed size thumbnails In a display area not large enough to display all the thumbnails and the use of scroll 
bars. Both of these contribute to user disorientation, general difficulty of navigating an image space, and an Inconsistent 
spatial representation of the Image space, amongst other things. 

35 Besides the disadvantages of only displaying only a portion of fixed size thumbnails, using scroll bars making 

navigation of an image space difficult, and providing an inconsistent spatial representation of the image space, such 
fixed size thumbnails themselves can use large amounts of memory or alternatively take relatively long period to de- 
compress from compressed image storage. For example, a conventional method of thumbnail zooming-involves per- 
forming progressive decompressions of the thumbnail image until the desired Image Is obtained. This provides for a 

40 representative depiction of the zoom as seen by the user but involves performing a number of complex decompressions 
in succession. This has been found to consume valuable processor and operator time which is undesirable. 

Thus, a need clearly exists for a method of providing thumbnails that are not fixed in size, have reduced memory 
requirements, are capable of rapid scaling, and provide for improved browsing applications. 

^ SUMMARY OF THE INVENTION 

It is an object of the present invention to substantially overcome, or at least ameliorate, one or more problems 
associated with prior arrangements. 

In accordance with a first aspect of the present invention there is provided a method of zooming an image on a 
so display device, said method comprising the steps of: 

displaying a first image at a first predetermined size; 

stretching or shrinking the first image to one or more other predetermined sizes; 
displaying at least one of said stretched or shrunken first image; and 
S5 displaying a second image at a second predetermined size on the display device. 

Preferably, the one or more other predetermined sizes are intermediate a first and second predetermined size. In 
the event where the first image is being stretched, the second image is preferably a greater resolution version of the 
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hrst image and where the first image is being shrunic the second image is preferably a lower resolution version of the 
first image. Typically, the first and second image are stored in a memory storage device (eg.: CD-ROM hard drive 
floppy drive, RAM, ROM card) in a compressed representation, and preferably are a first and second resolution of the 
same image respectively. 

In accordance wAt\ a second aspect of the invention, there is provided a method of zooming thumbnails of digital 
images for use in image brow/sing, the method comprising the steps of: 

laying out the thumbnails in at least one containment area having the same aspect ratte as an available display 
area of a display device; and 

scaling the thumbnails contained in the at least one containment area to fit within the available area of the con- 
tainment area. 



Preferably, the method further comprises the step of increasing the scale of the thumbnails if increased detail of 
the thumbnails is required for assessment. More preferably, the thumbnails are scaled to be as large as possible and 
'« fit within the available display area. 

The method may further comprise the step of organising the thumbnails into two or more groups in respective 
containment areas, and the groups may have a hierarchical structure. Still further, one or more groups laid out in 
respective containment areas may comprise two or more (sub)groups of thumbnails laid out in respective (sub)con- 
tainment areas, each (sub)containment area having the same aspect ratio as the parent containment area and the 
20 displayable area. 

In accordance with a third aspect of the invention, there is provided an apparatus for zooming digital images for 
use in image browsing, said thumbnails being displayable on a display module, the apparatus comprising: 

a device for laying out the thumbnails in at least one containment area having the same aspect ratio as an available 
2» display area of a display module; and 

a device for scaling the thumbnails contained in the at least one containment area to fit within the available area 
of the containment area. 
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Preferably, the apparatus comprises a computer and a computer readable medium having recorded thereon a 
computer program for zooming the digital images, the computer program canning out the method described above 
wherein Uie computer program can be loaded into and run by the computer to implement the apparatus 

In accordance with a fourth aspect of the invention, there is provided a computer program product having a com- 
puter readable medium having a computer program recorded thereon for zooming digital images for use in image 
browsing, the digital images each having a corresponding thumbnail, the computer program product comprising: 

a module for laying out the thumbnails in at least one containment area having the same aspect ratioas an available 
display area of a display module; and 

a module tor scaling the thumbnails contained in the at least one containment area to fit within the available area 
of the containment area. 



In accordance with a fifth aspect of the invention, there is provided a method of scaling thumbnails representing 
digital images, the method comprising the step of encoding a digital image using a hierarchical representation to provide 
a thumbnail. Further, the method includes the step of decoding the hierarchical representation of the digital image at 
a predetermined one of a plurality of scales to provide the thumbnail having a desired size. The scale of the decoded 
hierarchical representation may be equal to or greater than the desired size of the thumbnail. 

Still furtha®the method may include the step of downsampling the decoded hierarchical representation to provide 
the desired-size thumbnail If the scale of the decoded hierarchical representation is greater than the desired size of 
the thumbnail. AltemativeV, it may include the step of upsampling the decoded hierarchical representation to provide 
the desired-size thumbnail if the scale of the decoded hierarchical representation is less than the desired size of the 
thumbnail. The method may also Include the step of decoding the hierarchical representation of the digital image at 
two different adjacent scales of a plurality of scales to provide the thumbnail having a desired size The two decoded 
hierarchical representations having different adjacent scales can be interpolated to provide the desired-size thumbnail 
If the sc^^j^ne of the two decoded hierarchical representations is greater than the desired thumbnail size and the 
other of the two decoded hierarchical representations is less than the desired thumbnail size. The method may further 
»» comprise the step of displaying the thumbnail. 

Preferably, the encoding step includes the step of applying a hierarchical discrete wavelet transform to the image 
Advantageously the encoding comprises SWEET encoding. Compressed thumbnails and decompressed versions of 
thumbnails at particular sizes may be cached. Optionally, intermediate scales of decompressed thumbnails may also 
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be cached. 

In accordance with a sixth aspect of the invention, there is provided an apparatus for scaling thumbnails repre- 
senting digital innages, the apparatus comprising: 

5 means for storing a thumbnail, the thumbnail comprising a hierarchically encoded representation of a digital image; 

and 

means for decoding the hierarchical representation of the digital image at at least a predetermined one of a plurality 
of scales to provide the thumbnail having a desired size. 

70 In accordance with a seventh aspect of the invention, there is provided a computer program product comprising 

a computer readable medium having a computer program recorded thereon lor scaling thumbnails representing digital 
images, the computer program product comprising: 

means for storing a thumbnail, the thumbnail comprising a hierarchically encoded representation of a digital image* 
IS and 

means for decoding the hierarchical representation of the digital image at at least a predetermined one of a plurality 
of scales to provide the thumbnail having a desired size. 

Many other aspects of the invention and present disclosure will become apparent on review and understanding 
20 the following descriptions and claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

A number of embodiments of the present Invention will now be described with reference to the accompanying 
2S drawings in which: 

Fig. 1 shows an example configuratbn of a personal computer system of an embodiment; 
Fig. 2 is a schematic block diagram of the system of Fig. 1; 

Fig. 3 Illustrates a thumbnail management system including hardware and software Incorporating the arrangements 
30 of Figs. 1 and 2; 

Fig. 4 illustrates an initial and final stage of zooming up or down an image in accordance with one embodiment; 
Fig. 5A shows an example of Intermediate stages of an Image when zooming up; 
Figs. 5B and 5C illustrate alternate arrangements for zooming up; 
Fig. 6 illustrates an example of intermediate stages of an image when zooming down; 
35 Figs. 7 A and 7B schematically illustrates the steps in the zooming process according to a generalised embodiment; 

Fig. 8 is a schematic illustratbn of six groups of twenty-two thumbnails, each group in a respective containment 
area, in which one of the containment areas is scaled and laid out In accordance with the aspect ratio zooming 
process of an embodiment; 

Fig. 9 is a schematic illustration of the scaling of a group of thumbnails of Fig. 8 in accordance with the aspect 
40 ratio zooming process according to an embodiment; 

Figs. 10 to 12 illustrate disadvantageous aspects of using containment areas having different aspect ratios than 
that of the largest displayable area; 

Fig. 1 3 is a schematic illustration of the layout and scaling of several groups of thumbnails of Fig. 8 in respective 

containment areas having the same aspect ratio as the largest displayable area in accordance with the process 
45 of an embodiment; 

Fig. 14 is an Illustrative example of four groups of thumbnails organised in respective containment areas, each 

having the same aspect ratio as the largest displayable area, In accordance with the process of the an embodiment; 

Fig. 15 Illustrates aspect ratio zooming of one of the containment areas of Fig. 14, which in turn comprises (sub) 

groups of thumbnails organised in (sub) containment areas having the same aspect ratio as both the parent con- 
50 tainment area and the largest displayable area; 

Fig. 16 is a flow diagram illustrating the process according to the preferred embodiment; 

Figs. 17A-17D illustratethefastzoomprocessof Figs. 4 to 7 as applied to the containment areas of Figs. 14and 15; 

Fig. 18 Is a flow diagram Illustrating the encoding of a thumbnail In accordance with another embodiment; 

Fig. 19 is a flow diagram illustrating scaling of a thumbnail in accordance with another embodiment; 
55 Fig. 20 and 21 depict the use in combination of the embodiments of Figs. 1 to 9 and 1 3 to 1 9. 

Fig. 22 is a high-level block diagram illustrating the image representation technique according to the SWEET 

processes; 

Fig. 23 is a diagram Illustrating partitioning according to a SWEET process; 
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Fig. 24 is a flow diagram illustrating the method of representing, or encoding, an image according to a SWEET 
process; 

Fig. 25 is a detailed flow diagram illustrating the step of coding a region in Fig. 24; 

Fig. 26 Is a flow diagram Illustrating the method of decoding a coded representation of an image produced in 
accordance with the method Fig. 24; 

Fig. 27 is a detailed flow diagram illustrating the step of decoding a region in Fig. 26; 

Figs 28A to 28D are diagrams illustrating the processing of a two-dimensional, eight-coefficient region in accord- 
ance with the encoding and decoding method of Figs. 24 to 26; and 

Figs. 29 to 32 are flow diagrams illustrating the method representing, or encoding, an image according to an 
alternate SWEET method. 

DETAILED DESCRIPTION 

Figs. 1 and 2 Illustrate a representative embodiment of such a personal computer system 900 for Implementing 
the processes to be described. The computer system 900 Includes a computer 901 . and a video display monitor 904. 
It may also include one or more input devices 902,903. User inputs to operate the computer 901 are provided by the 
input devices. For example, a user can provide input to the computer 901 via a keyboard 902 and/or a pointing device 
such as the mouse 903. The computer system 900 may be connected to one or more other computers using a com- 
munication channel such as a modem communications path, a computer network, or the Internet, for example. Further, 
any of several types of output devices including plotters, printers, laser printers, and other reproduction devices may 
be connected to the computer module 901 . 

The computer module 901 has one or more central processing units (CPU or processor) 905, memory 906 including 
random access memory (RAM), static RAM or cache, and read-only memory (ROM), a video Interface/adaptor 907, 
an Input/output (I/O) interface 910, and storage device(s) generally depicted by block 909 in Fig. 9. A video interface/ 
adaptor 907 connects to the video display monitor 904 provides video signals from the computer module 901 for display 
on the video display monitor 904. Optionally, an Image or video capture device 920 (eg,, a digital camera) can be 
connected to the computer 901 via the interface 910 as a source of digital images or video sequences that can be 
represented by thumbnails in accordance with the described embodiment. 

The storage device(s) 909 may comprise a floppy disc, a hard disc drive, a magneto-optical disc drive, magnetic 
tape, CD-ROM and/or any other of a number of non-volatile storage devices. The components 905 to 910 shown In 
Fig. 9 ar^ coupled to each other via a bus 911. The bus 911 in tum can comprise data, address, and control buses. 
The overall structure and Individual components of the computer system 900 is essentially conventional and would be 
well known to persons skilled in the art. Thus, the system 900 Is simply provided for illustrative purposes and other 
configurations can be employed without departing from the scope and spirit of the invention. The computer system 
may be implemented using an IBM personal computer (PC) or compatible, one of the Macintosh (TM) family of PCs, 
Sun Sparcstation (TM), or the like. The foregoing are merely exemplary of the types of computers with which the 
embodiments of the invention may be practiced. 

In Fig. 3, representative components of the computer system 900 are shown which include a hardware system 
925, an operating system 930 and application software 935. Typically, the hardware system 925 includes the physical 
components of the computer system 900 described above with reference to Figs. 1 and 2, The operating system 930 
traditbnally includes management software applied within the computer system 900 for performing various tasks and 
handling the operation of the hardware. The application software 935 includes specific routines configured for perform- 
ing user desired tasks. The operating system 930 and application software 935 may reside in a permanent storage 
device 909 or in the memory 906 or be temporarily loaded from the device 909 to the memory storage 906. 

As illustrated, the operating system 930 includes an input device management system 931 which receives user 
instigated control signals typically from the keyboard 902, via a keyboard interface 926, and the mouse 903 via a mouse 
interface 927. The input device management system 931 processes the user's commands and transfers those to the 
application software 935 being operated at the time, in this case being thumbnail management software 936. As seen, 
the thumbnail management software 936 includes an output to a file system 933 incorporated within the operating 
system 930. The file system 933 provides for access to files and folders which may be stored on a disk drive 929 
accessed via a disk I/O interface 928 each within the hardware system. This drive 929 will be appreciated can comprise 
a part of the storage device 909 shown in Fig. 2. Further, the thumbnail management software 936 can output image 
^?S^i£"®"^® ® drawing management system 932 which in tum provides for the display of images on the display 904 
viia me video interface 907. Thus, for example, as shown in Fig. 1, the thumbnail management system may provide 
for the display of thumbnail images 940 which are Included as components as a Interactive graphical user Interface to 
a file management system 942 operating within the computer system 900. 

In this embodiment, the thumbnail management system incorporates a number of components, one being a sizing 
component 937 which provides for the zooming of Images and in particular thumbnail images, a grouping arrangement 
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938 which provides for the coordination of groups of thumbnail images, and a aspect ratio component 939 which 
provides for the coordination of the aspect ratio of varbus thumbnail groups. 

Fast Zooming of Images 

In dealing with images In general and colour thumbnail Images in particular, it is desirable for those Images to be 
zoomed-in and zoom-out rapidly when using file management systems without overburdening the processor 905 with 
excessive graphical and object and/or pixel-based image manipulations. 

Referring to Fig. 4, there Is shown a first image 10 as displayed for example on the display device 904 at a first 
predetermined size and which is intended to be displayed as a second Image 11 at a second predetermined size. In 
many Instances, the first and second sizes are variations (typically reductions) of the true image which, when correctly 
and fully displayed may occupy a full video display of say 1024 x 768 pixels at 24 bits of RGB colour per pixel. This 
true image is traditionally stored on the disk storage device 909 in a compressed format, for example using the JPEG 
standard. Typically, the second Image 11 corresponds to the first Image, such as a thumbnail image, but at another 
resolution scale. For example as illustrated, the first image 10 Is a 4x 4 pixel image which is enlarged through at least 
one intermediate displayed size (not shown in Fig. 1) to display substantially the same Image at a 16 x 16 pixel image 
resolution (ie: the image 11). 

Often, a "Thumbnail" size image Is referred to as being a smallest one of a number of predetermined sizes of a 
desired Image. The first image 10 of Fig. 4, for example. Is referred to herein as the "Thumbnail" size image and the 
second Image 11 is designated the desired Image. Shown in Fig. 4 Is a blKJirectional arrow 12 representing that a 
thumbnail size Image 10 can be zoomed to a desired image 11 or a desired image can be zoomed to a thumbnail size 
Image as hereinafter described. 

A thumbnail size image 1 0 and a desired image (eg. 1 1 ) can be stored in a memory device In a compressed format 
(representation), and preferably the thumbnail size image and the desired image are various resolutions of a single 
compressed Image. In various Image compression representations, a multitude of resolutions can be extracted from 
a single image stored on a memory storage device. As an example, an image compressed through wavelet coding 
allows an extraction of a plurality of resolutions of the compressed image. A wavelet compressed image can be thought 
of as a bit stream of ones and zero. By applying a decompression algorithm to the entire bit stream, the wavelet 
compressed Image Is substantially recovered (ie. wholly decompressed). However, by applying the decompression 
algorithm to a first portion of the bit stream, a lower resolution version of the wavelet compressed image can be recov- 
ered and thus, by applying the decompression algorithm to a greater portion than the first portion of the bit stream, a 
greater resolution version than the lower resolution version can also be obtained. An example of such a wavelet com- 
pression/decompression arrangement is discussed In detail later In this specification under the heading "SWEET - A 
Method For Digital Image Compression". Other examples of compression techniques that can be adapted to extract 
a plurality of resolutions of a single compressed image include JPEG (Joint Photographic Expert Group) compression. 

Referring to Fig. 5A, there is shown a thumbnail size image 20 which is extracted at a first resolution from a 
compressed true image and displayed on a display device. The thumbnail size image 20 is to be enlarged to a desired 
Image 21 at a predetennlned size greater than the thumbnail size image 20. The thumbnail size Image 20 is to be 
enlarged through a plurality of Intermediate step sizes 22,23,24 to the final predetermined size of the desired image 
21. In the present example, the intermediate sizes 22,23.24 are achieved by pixel replication. Each pixel 25 of the 
thumbnail size image 20 is replicated (reproduced) four times to obtain a block 26 of four pixels for a first intermediate 
size 22 of the image 20. Each pixel of the block 26 of the first Intermediate size 22 Is replicated four times to produce 
a block 27 of four pixels of a second intemiediate size 23 of the image 20. At this stage therefore, the original pixel 25 
has been replicated to form sixteen pixels. This block replication process is repeated until a predetermined final size 
is achieved. Preferably, the predetemnlned final size is the same size (dimensionally) as the desired image 21 . In this 
connection it is noted that the images 20 - 24 of Fig. 5A are not drawn to scale and the relative depiction of the pixels 
are for illustrative purposes. At this stage the desired image 21 Is decompressed (extracted) from the compressed 
Image and is displayed at the predetermined final size of the image 20. At least one of the intermediate stage sizes 
22,23,24 Is preferably displayed on the display device to give a visual appearance of "zooming-in" from a thumbnail 
size Image 20 to a predetermined final size of a desired Image 21. 

In this fashion, zooming-tn on a thumbnail can be achieved by pixel replication and is preferably performed whilst 
decompression to the desired size/resolution of the true Image is taking place so that once Instigated by the user, for 
example by clicking the mouse 903, the zoom "appears" to occur as the replicated stages are re-displayed. On com- 
pletion of the zoom however, a more accurate, non-replicated but decompressed Image is thus displayed. Thus, only 
one image (ie. the desired final image) need be decompressed and so only one decompression time is required to 
achieve and display the zoom-in effect. During that time, the user is presented with a progressively staged zoom of 
the rudimentary thumbnail Image which, whilst not of high image quality, provides the user with a transitory progression 
between image sizes thus assisting In maintaining user orientation within the file/folder structure during browsing op- 
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eratlons. 

The pixel replication process described above Is made with reference to a 2 x 2 (4 times) increase In size of the 
Intermediate stage sizes. However, increasing the intermediate stage sizes at other than multiples of four can be op- 
tionally performed, as Illustrated In Figs. 5B and 5C. 

As an example of stretching the thumbnail image 20 by pixel replication, at other than by a multiple of four, an 
Interpolated value between two adjacent pairs of pixels in the thumbnail image 20 can be obtained and the Interpolated 
value is replicated three times to produce three output pixels for each dimension of a first Intermediate size. This results 
in an appearance that the thumbnail size Image 20 has Increased In size by 1.5 for each dimension (1 .5 x 1 .5). 

As seen In Fig. 5B, a 4 x 4 thumbnail Image 28A Is provided having pixel values a.b.c, ... p. In this example, it is 
desired to stretch the thumbnail image 28A by a factor of 1 .5 in each dimension to provide a 6x6 Intermediate stage. 
As noted above, adjacent pairs of pixels of the thumbnail image are Interpolated. In this fashion, adjacent row pairs of 
pixel values "a' and "b" are Interpolated to provide a pixel value "1", which is then replicated three times. Similarly 
pixel values "c" and "d" are Interpolated to give an interpolated value of "2" which is also replicated three times. This 
process Is applied to each adjacent pair of pixels in each row of the thumbnail image 28A to provide a 6 x 4 array 28B 
of pixel values 1 ... 8. In a similar fashion, but working with adjacent values in each column, interpolation proceeds. 
Pixel values 1 and 3 are interpolated to provide a pixel value 9 which is replicated three times. This occurs for each of 
those pixels in that combination of values. Thus, a 9 x 9 array of pixels 28C Is formed which comprises four Interpolated 
values, 9, 10. 11 and 12 spread amongst 36 pixel locations. 

With reference to Fig. 5C. an alternative method can be provided whereby all adjacent pixels are interpolated. As 
shown in Fig. 5C. an input 4x4 array of pixels 29A comprising pixel values a, b. c ... p is provided. An Interpolated 
value T Is obtained from input values a and b and is replicated three times as illustrated. An interpolated value 2 is 
obtained from adjacent pixels b and c and also replicated three times. Similarly an interpolated value 3 is obtained 
from adjacent pixels c and d and also replicated. This process proceeds for each row of the input array to reveal a 9 
X 4 array of pixels of Interpolated values 29B. In a corresponding manner to the arrangement of Fig. 5B, the array of 
pixels 29B is further interpolated based upon the adjacency of the values such that a value 1 3 is obtained by Interpolating 
values 1 and.4 and Interpolated value 18 Is obtained by Interpolating the values 4 and 7, and so on. Thus, it will be 
appreciated using this approach that a 9 x 9 array 29C of pixels can be obtained from a 4 x 4 array, giving an enlargement 
factor of 2.25. 

Further, whilst the foregoing has been described in relation to the interpolation of adjacent and pairs pixels, other 
mathematical processes may be applied to obtain the desired result. For example, adjacent pixel values may be av- 
eraged or,blends provided between Intemiediate new pixels. However, it will be appreciated that the complexity of 
some mathematical operatbns will Increase the computational time required to determine the intermediate zooming 
stages. As the intermediate zooming stages are typically quite transient, the need for higher quality is generally not 
necessary as ail that is required Is for the user to be given the impression of a particular thumbnail being enlarged by 
zoomlng-ln. Thus, distortions of the Image due to pixel replication/averaging etc. are tolerable as It Is the original image 
as decompressed which results and which leaves the greatest lasting impression upon the eye of the user. 

With reference to Fig. 6, a desired image 30 is to be contracted (shrunk in size) through a plurality of intermediate 
stage sizes 31,32.33 to a thumbnail size image 34. to produce a visual effect of "zoomingouf. The "zooming-out* 
technique is achieved by decompressing the true image, from a compressed image fomiat, to provide a desired image 
30 at a first Image size (dimensbn), and a first resolution. Then an average for a plurality of pixels 35 is obtained to 
produce an average pixel value, the average pixel value being assigned as a single pixel of a first intermediate stage 
size 31 of the desired image 30. In the illustrated example of Fig. 3, each group of four pixels 35 of the desired image 
30 is averaged to produce a single pixel 36 of the first intemnediate stage 31. A group of four pixels 36 of the first 
intermediate stage size 31 of the desired image 30 is averaged to produce a single pixel 37 of a second intermediate 
stage size 32. This process Is repeated for each four pixel grouping of the first intermediate stage 31 until each pixel 
of the second intermediate stage size 32 of the desired image 30 is produced. The process of averaging four pixels to 
produce a pixel of a next intermediate stage size 33 is repeated until preferably a thumbnail size 34 of the desired 
image 30 is reached. Once the thumbnail size 34 image Is substantially achieved, a low resolution image may be 
extracted from the compressed Image and displayed at the thumbnail size 34. Again, as with the pixel replication 
process described with reference to Figs. 5B and 5C, the desired image can be reduced in size by other than a factor 
of four Averaging, for example, twenty-five pixels of one intermediate stage size to achieve four pixels of a next Inter- 
mediate stage size, results in a reduction size ratio of 6.25 :1 . 

An altemative option for shrinking the size of an image Is to sub-sample the image. That Is, one of every n number 
of pixels of an Image Is selected as a pixel to form a reduced image. The reduced image having a reduction (size) ratio 
/7:1. 

Referring now to Fig. 7A, a schematic flow diagram illustrates steps for zooming-ln or zooming-out an image In 
accordance with the embodiments of the present invention, which are preferably implemented as software operating 
within the sizing module 937 of the application software 935. 
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channel between the computer and another device, a computer readable card such as a PCMCIA card, a network 
connection to another computer or networked device, and the Internet and Intranets including e-mail transmissions 
and information recorded on websites and the like. The foregoing are merely exemplary of relevant computer readable 
media Other computer readable mediums may be practiced without departing from the scope and spirit of the invention. 

Fig. 1 6 is a flow diagram illustrating embodiments of the aspect ratio zooming process 1 20 preferably implemented 
by the software module 939 of Fig. 3. Processing commences at step 121 in which the thumbnails, organised in one 
ornrxjre groups, are laid out in respective containment areas. Each containment area may in turn contain (sub) con- 
tainment areas and has a predetermined aspect ratk>. The aspect ratio is the ratio of the width to the height of the 
largest displayable area available (which may be the entire screen, a portion of the screen, or a window, for example) 
on the display device. This is described hereinafter in greater detail with reference to the Figs. 8 to 1 5. In step 1 23, the 
thumbnails are scaled in each containment area to fit with the available area of the containment area. It will be appre- 
ciated that this fonms a hierarchical, tree-like structure of sorts, In which one containment area may form the uppermost 
containment area within the displayable area. Steps 121 and 123 constitute the essential steps of this embodiment. 
The process may constitute these steps alone, but optionally also include steps 124 to 126. 

Preferably, a check is made in decision block 1 24 to determine if a user selected containment area Is to be zoomed 
in. If decision block 1 24 returns true (zoom-in), processing continues at step 1 25. In step 1 25. the selected containment 
area is enlarged (zoomed up) to the available display area, and this Is likewise carried out for any (sub)containment 
areas with respect to the relevant parent containment areas. Execution then continues at step 121. where the aspect 
ratio zooming process is applied in steps 121 and 123. 

Otherwise, if decision block 1 24 returns false (zoom-out), processing continues at step 1 26. In this step, the reverse 
or complementary process of step 125 is carried before processing retums to step 121 . That is. a parent containment 
area of the selected group Is set to the available display area, and likewise the selected containment area is reduced 
in size, along with its thumbnails. 

The process of aspect ratio zooming-in this embodiment overcomes problems of user navigation and orientation 
within the image space. Fig. 8 shows an example of an entire image space containing a total of 132 Image thumbnails 
divided into six distinct groups. The image thumbnails are represented by grey numbered squares. A primary contain- 
ment area 100 represents the entire space available for the display of image thumbnails, for example, on a video 
display monitor. In general, the primary containment area defines the aspect ratio of the display Contairiment areas 
102. 104. 106, 108, 110 and 112 provide visual segregation for the six groups of Images within the image space. For 
this example, each group contains 22 image thumbnails and displayed at unifonn size and scaled to the largest possible 
size that pemnits the group to fit within the respective containment area. As seen in Fig. 8. containment area 1 08 is the 
only area that has the same aspect ratio as the primary containment area 100, in this case a ratio of about 1 .320. 

During nomial browsing or searching activity, users typically scan the entire search domain (in this case, the entire 
image space) and then refine their search by choosing to take a closer took at perceived areas of Interest. In this case, 
the user may choose to zoom-in on the images displayed in one of the containment areas. Containment area 108 as 
Illustrated has been laid out in accordance with the process of the preferred embodiment. Only the images in contain- 
ment area 108 (which has the same aspect ratio as the primary containment area 100) can be zoomed up to fill the 
primary containment area 100 (ie., the entire space available for the display of images) without causing orientation 
problems for users. 

Fig. 9 Illustrates the process of aspect ratio zooming the thumbnails of containment area 108, having the same 
aspect ratio as the largest displayable area, in accordance with the process of the preferred embodiment. Thus, the 
images in containment area 108 can be enlarged to fill the primary containment area 100 without being rearranged. 
This is important since it provides a consistent spatial representation of the image space. In contrast, zooming up one 
of the other image containment areas 102. 104. 106. 110 or 112 of Fig. 8 (so that the image thumbnails within that 
group are made as large as possible to fill the primary containment area 100) results in the images being rearranged 
as Is described hereinafter with reference to Fig. 10. 

Fig. 10 shows how the images in containment area 106 have to be rearranged If they are zoomed up to fill the 
primary containment area 100. Altematlve methods of zooming up a containment area with a different aspect ratto to 
the primary containment area 1 00 are shown in Figs. 1 1 and 1 2. Both of these necessitate the use of scroll bars 1 1 4, 1 1 5 
(or similar controls) and lead to orientation and navigational problems for users. The useability problems associated 
with using scroll bars are known. 

Fig. 1 3 shows how the thumbnails of Fig. 8 can be rearranged in accordance with the aspect ratio zooming process 
of the preferred embodiment. Each of the containment areas 102 to 112 of the six thumbnail groupings is laid out to 
have the same aspect ratio as the principal containment area 1 00. In this manner, all six image groupings are displayed 
in areas of the same aspect ratio as the primary containment area. 

Fig. 1 4 illustrates an example of the aspect ratto zooming process extended so that image containment areas with 
fixed aspect ratios can display other image containment areas with the same aspect ratio. The containment areas 
containing these groupings may also in turn comprise (sub) containment areas. Thus, a hierarchical structure of con- 
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tainment areas and (sub) containment areas is provided. The structure of the image space is depicted in the (Windows 
or Internet) •Explorer'(TIVI) type tree view 1 30 on the left hand side of the screenshot or image. Each of the containment 
areas is depicted as a folder in the tree. The largest displayable area or principal containment area is labelled "Library" 
1 32. and it contains four groupings of thumbnails labelled "Australia" 1 34, "Transport" 1 36, "Images of the World" 1 38 
and "People" 140. 

Whereas the containment area 1 34 labelled "Australia" contains only image thumbnails, the containment area 1 40 
for "People" contains further image thumbnail groupings ("Children" 141, "Corporate people" 142, "Couples" 143 
"Sports people" 144), This equally applies to the containment areas "Transport" and "Images of the World". In this way.' 
the aspect ratio zooming supports the meaningful and concurrent display of an entire image space. 

Fig. 1 5 illustrates the enlargement of the "People" containment area 1 40 so as to fill the available display space 
(formerly labelled "Library"), Having had a closer look at the contents of the "People" group, a user can choose to 
2oom-in-furtheronanothercontainmentarea(for example, "Sports people" 144containsfourfurther groups 145-148) 
or zoom back out. In this way. users are able to reduce or enlarge any containment area to fill the display space available 
while niaintaining context and spatial reference. 

The process of aspect ratio zooming according to the preferred embodiment is advantageous in that it overcomes 
disadvantages of the prior art. This includes the technological problems associated with scaling images up and down 
at adequate speed. Zooming (moving from displaying a group of images and/or containment areas, to displaying the 
contents of one containment area) can be perfomied in a number of ways: 

1 ) Switching displays: simply switch from the initial state to the zoomed in state, which is advantageous in that it 
is very fast; 

2) Zooming with outlines: drawing the outline of each image or containment area being zoomed at each zoom 
step, which IS also very fast, and gives the user some feel for the zoom process; 

3) Accurate zooming: performing a number of intemfiediate zoom steps, and redrawing the display accurately at 
each step, which is advantageous in that each zoom step kxjks appealing; 

4) Bitmap scaling: scaling up (or down) a bitmap of the whole (or part of)' the primary containment area 100, to 
each zoom step, and then accurately redrawing at the end of the zoom; this is very fast and gives the user a good 
feel for the zoom. 



Scaling the bitmap" in above item (4) is easily performed if the bitmap of the area being zoomed into has the same 
aspect ratio as the viewing area. If it does not have the same aspect ratio, the zoom steps do not correspond with the 
final resulting display. An example of such zooming is the fast zoom described above with reference to Figs. 4 to 7B 

After the last zoom step, the display contains the "scaled bitmap". By drawing the background qutekly, but leaving 
the scaled images untouched, a good approximatkm of the final display may be obtained. Each image may be redrawn 
properly one at a time while allowing the user to interact with the program. The longer the user waits, the better is the 
quality of the display (until all the images are redrawn). 

A further and preferred implementatton is illustrated in Figs. 17A-17D which combines aspect ratio zooming with 
an alternative implementation of the previously described fast zooming of thumbnails. In particular, the present inventors 
have determined that image-by-image thumbnail stretching as may be required when zooming a containment area can 
be excessively time consuming particularly with the need to redraw the background of the containment area with each 
intemnediate stage. This difficulty is overcome by selecting the entire containment area (window) and processing that 
area as if it were a single input image whereby the entire containment area is stretched/shrunk including text and 
images, and when the final size is obtained using the fast zooming process previously described, the entire containment 
area/window is redrawn including background, title and individual images. This is depfcted in Figs. 17A-17D where it 
will be appreciated that Fig. 17A is a schematic representation of the library window 1 32 seen previously in Fig 1 4 In 
this embodiment, it is desired to zoom-in upon the "people" containment area 140 to reveal the image shown in Fig 
1 5, Fig. 17A shows the library display in its initial form prior to the commencement of the zoom. In Fig 1 7B the people 
containment area 140 has been zoomed to a first intemiediate size and is shown illustrated commencing to occlude 
the other containment areas. Fig. 17C shows a further transitional stage where the people containment area 140 
substantially occludes the remainder as the zoom proceeds. Fig. 17D shows the configuration of the display at the 
completion of the zoom. In the intermediate displays of Fig. 178 and Fig. 17C the people containment area 140 is 
displayed using pixels replicated from the people containment area shown in Fig. 17A. In Fig. 17D however each of 
the individual thumbnails and sub-containment areas 141-144 are decompressed to reveal the desired higher quality 
images. a -i / 

The process of aspect ratio zooming according to the preferred embodiment is also advantageous in that It provides 
a new and superior image browsing system given accepted wisdom regarding database navigation and the principles 
of browsing. The process also complies with the guiding principle for the design of database navigational systems 
which IS to make users aware of the entire contents of the,search domain and to facilitate rapid examinatfon of any 
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individual item in greater detail (eg, see Spence, R. and Apperly, M.D. (1982) "Database Navigation: An Office Envi- 
ronment For the Professional". Behaviour and Information Technology, 1,1. pp 43-54). 

The process also exploits the remarkable (and well documented) spatial memory and visual scanning capabilities 
of humans to assist in the browsing process (rather than simply relying on recall through prior classification) Still further 
It IS advantageous in that it provides continual reinforcement as to the structure and layout of the image space which' 
albeit unconscious, is generally considered to enhance spatial memory. It also capitalises on the almost limitless ca- 
pacity of human recognitbn memory (eg, see Standing, L. (1973) Teaming 10,000 Pictures-, Quarterly Journal of 
Experimental Psychology. 25, pp. 207-222) and the highly developed skills for pattern recognition by displaying all 
thumbnails in the search donrain at maximum size possible while maintaining context and providing consistent spatial 
r©iQr©nc©. 

tK '^"^^ T'- ^"^^^ ^^PP^ ""^"^S judgments about the content of an image from a thumbnail smaller 
than 80 X 8(>pixels. image thumbnails as small as small as 6x6 pixels can nonetheless be easily distinguished from 
each other. Further, in combination with foreknowledge of the image content, the human perceptual system is able to 
recognise irnages from thumbnails displayed as small as 14 x 14 pixels. Consequently, for the purposes of browsing 
(as distinct from assessing content) considerable value can be achieved by displaying thumbnails at much smaller 
sizes than they are displayed using prior art techniques. 

Compressing and Scaling Thumbnails 

This embodiment is directed to a method, apparatus, and computer program product for encoding and scaling 
thumbnails; That is. image thumbnails are generated at vaiying resolutions or sizes. The various resolutions or sizes 
may be predetermined, which can be adjusted to a different size than that of the predetermined one Thumbnails 
provided in accordance with the preferred embodiment are particularly useful in applications for image browsers used 
to navigate large databases of digital images. While the use of thumbnails produced in accordance with the preferred 
embodiment has particular application to browsing image databases, it will be apparent to those skilled in the art that 
such a technique can be advantageously practiced in other applications without departing from the scope and spirit of 
the invention Ir. the folkwing description, numerous speciffc details, such as the size of image thumbnails, application 
software embodying the process, screen displays, arrangements of thumbnails, etc. are described in detail to provide 
a more thorough description of the present invention. It will be apparent, however, to one skilled in the art that the 
present invention may be practiced without these specific details, in other instances, well-known features are not de- 
scribed in detail so as not to unnecessarily obscure the present inventran. 

Oven/iew 

The process of compressing and scaling a thumbnail in accordance with the preferred embodiment enables the 
display of niany small thumbnails or image icons on the one hand, when necessary, and allows such thumbnails to 
also be displayed as large as possible on the other hand. In a browsing application where elements of a database are 
n!^lH^t!T « thumbnails, the encoding and scaling process can be advantageously emptoyed To optimally meet the 
noted tradeoff, thumbnails preferably fill a display window when displayed and do not unnecessarily leave significant 
portions of a display window unused. As a rough approximation, the thumbnails fill a constant "image data" regardless 
of the number of thumbnails displayed. If there are many images to be displayed, the thumbnails are made small while 
If there are only a few thumbnails to be displayed they are made larger The idea of a constant image area is only 

nSmbel ^ °' '^^"^ °" *® ^'^^ '^'^^ '"'^S® s*^® 

The process of the preferred embodiment also provides compression to minimise storage requirements for such 
thumbnails, E)gg small images require a significant amount of space and hence a large number of thumbnails can 
use a significant amount of memoiy. Compression is also desired for memory effteiency. It is often desirable that many 
thumbnails are stored in memory, ratherthan on disk, forfaster access. Memory is nonnally at a premium and therefore 
the process of the preferred embodiment stores the thumbnails in memory in a compressed format 

In this process, a hierarchical image icon representation is used to meet the "constant area'/variable imaqe size 
requirement. To use compression, relatively fast decompression is required so that the image icons are displayed in 
an acceptable amount of time. To display a "constant area" of images in substantially the same amount of time the 
process ^jMpreferred embodiment provides a decompression time that is substantially proportional to the size of 
the imagrWclisplaying the different types of "constant area" image sets in the same amount of time, the process 
ensures that each image set is displayed in an acceptable amount of time. II each image set is displayed in substantially 
the same amount of time, a "worst case" display time problem can be minimised or eliminated using this process Thus 
all image sets are displayed as quickly as possible. nius. 
To meet the foregoing requirements, the process of the preferred embodiment utilises an efficient hierarchical 
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discrete wavelet transform (DWT) coded representation. DWT compression techniques offer good Image compression, 
and the coding of the subbands produced by the DWT in terms of increasing frequency offers a hierarchical represen- 
tation without substantially sacrificing compression efficiency (ie., amount of compression). Still further, coding of the 
subbands in such an order offers decompression times proportbnal to the size of the decoded image, and independent 
of the size of the original image. Using a suitable DWT compressed representation also has another advantage. The 
image Icons can actually be decompressed at a size greater than the size of the original compressed image icon. This 
Is simply achieved using the inverse DWT engine within the decompression code. The process of the preferred em- 
bodiment utilises the SWEET compression technique which is particularly suitable for this compression application. 
The SWEET technique Is disclosed In Australian Provisional Application No. PO 4728 filed on 22 January 1997 and 
entitled "A Method for Digital Image Compression", the relevant disclosure of which is reproduced below under a 
corresponding sub-heading. Compression and decompression using SWEET are both very fast. Also, the subbands 
are coded in order of Increasing frequency and Independently. Further, SWEET provides high compression efficiency 
Because the process of the preferred embodiment uses a discrete wavelet transform coding, It can decode small 
resolution versions of images with a decoding time linearly dependent on the size of the decoded Image. Therefore, a 
constant image "area" can be decoded independent of the size of the images that constitute an area. An efficient 
hierarchical DWT representatbn meets the desired requirements of fast decompression and hierarchical size decom- 
pression with decompression speed substantially proportional to the size of the decoded image. Further, this hierar- 
chical representation can be achieved without sacrificing compression efficiency and in fact compression can be ob- 
tained competitive with state of the art techniques. Finally, using a DWT representation, the inverse DWT engine can 
be used to display the image icons at a size larger than the original compressed size. 

Exemolan/ Environment Using Thumbnails 

As discussed above Fig. 1 3 is a block diagram illustrating six groups 102-112 of thumbnails displayed on a display 
device 1 00 (eg. , a computer display monitor). Each of the exemplary groups 1 02-11 2 contains 22 thumbnails numbered 
accordingly. While particular groupings and numbers of thumbnails are shown, the embodiments of the Invention are 
not so limited, and the example of Fig. 13 is provided for ease of description with each thumbnail depicted as a grey- 
filled block. 

As discussed above, each thumbnail Is a visual representation, preferably of a corresponding Image file. In the 
case of Image files, each image file has a reference to it as well as a con-esponding thumbnail. The thumbnails or 
image icons provide an indication of the Image they represent, and are small versions of the images. References to 
the image files are stored in a database (although this could easily be extended to non-image files), and may be 
acquired from a file system, or from an input device such as a digital camera or a scanner When an image file is added 
to the database system, a reference to the Image file is stored, and a thumbnail is created for the image. While the 
preferred embodiment uses thumbnails that represent still images, the invention is not limited to this particular appli- 
cation. Instead, thumbnails using the method of the preferred embodiment described below can be used to represent 
other things including video sequences, a document for which an icon can be made, etc. 

As shown In the screenshot of Fig. 1 4, each of the thumbnails Is a graphical representation of an image. Dependent 
upon the particular application involved, the arrangement of the thumbnails can reflect a number of things: the semi- 
hierarchical categorisation of the Images into folders (a directed acyclic graph with a single root, but only the leaves, 
ie.. the images themselves, can be multiply referenced); a display of the Images brought Into the system but not yet 
categorised: or the results of some automated search through the Images In the system. Large numbers of images 
may be stored In the system. For example. Image databases may not uncommonly contain ten thousand images. In 
fact the only limitation on the size of the image database is the storage capacity and performance of the computer 
being used. 

In this example, the top-level display logically shows all the categorised images. For each internal node of the 
graph (each folder) shown on the left-hand side of Fig. 14, a rectangular array of the thumbnails is displayed in the top 
left comer and at the same size for alt the leaves directly connected to this node. In the remaining space, an outline is 
laid out of every internal node that depends from this node. This process is repeated recursively (although at smaller 
scales) within each outline, ie. for each internal node. The hierarchy of thumbnails is navigated by zooming-lnto and 
out of the folders In the exemplary application. Because a directed acyclic graph is being represented, a given image 
may also appear several times in the hierarchy Since the places where the Image appears may be at different levels 
of the hierarchy, several thumbnails may simultaneously need to be displayed for the image at different sizes. 

Encoding and Scaling Process of Preferred Embodiment 

To reduce the memory requirements for potentially large numbers of thumbnails and to provide rapid scaling of 
thumbnails, the preferred embodiment as depicted in Fig. 18 provides a method 150 of encoding thumbnails, where 
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images are compressed and the form of compression enables rapid scaling of the thumbnail using a hierarchical rep- 
resentation. Preferably, in step 154, one or more thumbnails are generated by compressing respective images using 
a hierarchical discrete wavelet transform (DWT) compression technique. More preferably, the hierarchical DWT com- 
pression technique employed is the SWEET technique. 

Fig. 19 is a flow diagram Illustrating the process of scaling a thumbnail in accordance with the preferred embodi- 
ment. In particular, the flow diagram illustrates the process 160 of decoding thumbnails, where each thumbnail com- 
prises a hierarchical representation of a digital Image, at a predetermined one of a plurality of scales to provide the 
thumbnail for display having a desired size. Processing commences in step 1 62. In step 164, a thumbnail comprising 
the hierarchical representation of an image Is decoded for the purpose of producing a desired size or scale thumbnail. 
The thumbnail is decoded to the scale closest to the desired size of the thumbnail for display The scale of the decoded 
hierarchical representation may be equal to the desired size of the thumbnail. If it is not, the decoded hierarchical 
representation is preferably larger than the desired size. Again, the hierarchical representation is preferably an image 
compressed using the SWEET compression technique. In decision block 166, a check is made to determine if the 
scale of the decoded thumbnail equals the desired size to be displayed. If decision block 1 66 returns true (yes), process- 
ing continues at step 170. Otherwise, if decision block 166 returns false (no), processing continues at step 168. 

In step 1 68, the size of the decoded hierarchical representation is adjusted to the desired size of the thumbnail to 
be displayed. Preferably, the adjustment step involves uniformly scaling down or downsampling the decoded hierar- 
chical representation to provide the desired-size thumbnail If the scale of the decoded hierarchical representation is 
greater than the desired size of the thumbnail. Scaling of the thumbnail is preferably always down, and by less than a 
factor of 2 in each linear dimension. In operating environments such as Microsoft (TM) Windows (TM), in-built Windows 
functions may be used to do this, giving the system the opportunity to optimise the operatton as best It can. 

In step 170, the decoded hierarchical representation is displayed as the thumbnail at the desired size. Coded and/ 
or decoded hierarchical representations (eg, compressed and/or decompressed thumbnails) are preferably stored in 
a cache or memory of thumbnails for later rapid retrieval and display Optionally, because the process of decompressing 
the hierarchical representation to the desired scale can involve decoding several intermediate scales or sizes of the 
hierarchical representatkxi, the intermediate scales may also be stored in the cache or memory of thumbnails. For 
example, a decoded thumbnail displayed at a size of 64 x 64 pixels may subsequently be displayed at a size of 1 6 x 
16 pixels. If so, an earlier decoded thumbnail of this size may be retrieved from the cache, where the smaller size 
thumbnail was obtained in the process of decoding the 64 X 64 pixel thumbnail. Processing terminates in step 172. 

While steps 1 64 and 168 preferably involve downsampling to adjust a (larger size) decoded hierarchical represen- 
tatbn to the desired size, other techniques of doing so may be employed without departing from the scope of the 
Invention. For example, the method of Fig. 19 could alternatively involve decoding the hierarchical representation to 
a scale smaller than the desired thumbnail size and upsampling the decoded hierarchical representation to provide 
the desired-size thumbnail. As a further alternative, the method may involve decoding the hierarchical representation 
in step 1 64 at two different adjacent scales or sizes (eg, 64 X 64 pixels and 32 x 32 pixels) to provide the desired size 
thumbnail. In step 168, interpolation can be carried out between the decoded hierarchical representations to provide 
the desired-size thumbnail. In this case, one of the scales of the decoded hierarchical representation is greater than 
the desired thumbnail size and the other scale is less than the desired thumbnail size. 

Returning to Fig. 15 is a further exemplary screenshot illustrating the zoomed up display of thumbnails or image 
icons of the People group of Fig. 14. The scaled up versions of the thumbnails shown in Fig. 15 may be obtained in 
accordance with the process of Fig. 1 9. 

Preferably, the process described above is implemented with the thumbnail encoding and scaling prxxjess embod- 
ied as software or a computer program (eg, within a browser application) recorded on a computer readable medium 
and loaded Into the computer system 900 from the computer readable medium. Examples of such computer readable 
medium include a floppy disk, magnetic tape, a hard disc drive, a ROM or integrated circuit, a magneto-optical disk, a 
radio or Infra-red transmission channel between the computer and another device, a computer readable card such as 
a PCMCI A card, a network connection to another computer or networked device, and the Internet and Intranets including 
e-mail transmissions and information recorded on websites and the like. The foregoing are merely exemplary of relevant 
computer readable media. Other computer readable media may be practiced without departing from the scope and 
spirit of the invention. 

The method according to the preferred embodiment, and in particular using the SWEET compression technique, 
satisfies a number of concurrent requirements for storing and scaling thumbnails and therefore has a number of ad- 
vantages. Firstly, the compression of the thumbnails allows as many thumbnails as possible to be stored in memory. 
This is particularly important where large numbers of thumbnails may be loaded into memory in an application where 
a user is for example browsing a large database of images. More preferably, a cache may be used to store thumbnails 
for rapid access, including multiple, different-sized decoded versions of thumbnails. Secondly, the preferred embodi- 
ment makes it possible to decompress thumbnails quickly This provides rapid response time for users, especially in 
applications involving navigation of a database, such as an image database, where a user may zoom a thumbnail up 
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or down. The method according to the preferred embodiment provides compression efficiency and decompression 
speed. Thirdly, the preferred embodiment provides satisfactoiy image quality at all sizes or scales at which thumbnails 
may be displayed. This third requirement Is achieved In the preferred embodiment without requiring multiple compres- 
sions of the thumbnails stored at varying sizes as might be required by JPEG compression, or having a single com- 
pression of an image at the largest possible size needed and then scaling down uniformly Thus, the preferred embod- 
iment does not wastefully use up more storage and memoiy, and is not stow. 

Compressing an image to produce a hierarchical representatton using the SWEET compression technique has a 
number of advantages. The compression achieved by SWEET is typically 10:1 and therefore provides a good com- 
pression factor. Further, decompression using SWEET is fast and linear in relation to the number of pixels decom- 
pressed Thus, the decompression time for displaying a large number (eg, a screen full) of small thuiT*nalls is of the 

sameorderasdisplayingafewlargeslzethumbnails. Further. SWEEThierarchically encodes thumbnails, and therefore 
It 18 not necessary to decompress the whole data set to get at the smaller size thumbnails. In fact, because the SWEET 
compression technique sequentially codes data, the first portion of SWEET compressed data contains all the Informa- 
tion for decompressing the smallest size Image, the next portion contains additional infonnation required for decom- 
pressing the next larger size, and so on. Still further, performance is further enhanced because in the case of displaying 
a number of the smallest size thumbnails, all the data can be prefetched by the CPU cache 
ei*,T^x ^ ®' "^^ highest available resolution stored by SWEET is made. In particular 

SWEET provides a "minus 1" decompression, which provides a thumbnail which is twice the size (in each linear di- 
mension) of the uncompressed thumbnail. When the minus 1 image is scaled down, satisfactory image quality is still 
provided. If a thumbnail is desired for viewing at a size that Is larger than the maximum size provided by SWEET (which 
IS a tunable parameter), the "minus 1" version can be scaled up. A further advantage of the preferred embodiment Is 
that system can handle thumbnails whose uncompressed fonms have different sizes. 

As will be understood from the foregoing the, method according to the preferred embodiments permits thumbnails 
or images to be displayed in various different arrangements that require, at least, one or more of the following- one or 
more different images; or one or more images at different uniform scales; or one or more images In different positions 
The method also facilitates changing between different display arrangements quickly. Images can be displayed with 
satisfactoryquality regardless of the size. 

Fig, 20 illustrates In flowchart form how the various embodiments described above may be practically implemented 
in the computer system 900. At step 181 the aspect ratio of the display 904 is identified from predetermined hardware 
parameters. At step 182, the containment area within the browser applicatfon software Is adjusted in shape to have a 
corresponding aspect ratio as described with reference to Figs. 8 and 13. At step 183 the number of thumbnails are 
Identified so as to permit detemiinatton of the thumbnail size within the containment area. At step 184 the thumbnail 
size is adjusted according to Figs. 4-7B. 16 or 18-1 9 so that all thumbnails fit within the containment area At step 185 
the re-sized thumbnails are arranged in the containment area and finally at step 156 the user may re-size the contain- 
ment area to facilitate ease of browsing. 

Fig. 21 shows how the above appears at a user level. Steps 187A - 187C each identify directory movements that 
may be instigated by the user in a conventional manner. When the correct directory is identified at step 187B at step 
188 A the user views the directory to identify a target image. Such an image is selected at step 188B which causes a 
fast zoom-in as described above. The user at step 188C views an intermediate sized image to confinn the correct 
image^lf not, a zoom-out is perfonned and another selection made at step 188A. If the correct image is identified at 
step 188C. that may be selected to cause the tme Image to be decompressed and displayed at a high quality level In 
each of the directory operations 187A-187C. the aspect ratio based manipulatton of directories and containment areas 
may be perfomied. which in the preferred embodiment is combined with hierarchical representation of the image and 
Its zooming. " 

SWEET - A Method For Digital Image Compression 

This disclosure relates to a method and apparatus for representing digital image data, and in particular to a method 
and apparatus for encoding and decoding transfomi coefficients obtained from digital Image data 

A number of transfomri-based image coding techniques are known which Involve linear transforming a source 
image to decorrelate data and then encoding the transform coefficients. Such conventional techniques include the 
JPEG standard image compression method, wrtilch employs an 8x8 bkx:k discrete cosine transform (DCT) JPEG 
encoding involves transforming blocks of a source image using the DCT quantising the resultant transform coefficients 
where most of the compression is effected taking advantage of visual perception, and lossless encoding the quantised 
coefficients in a predefined zig-zag sequence from towest frequency coefficients to highest frequency coefficients 

There is also a compression technique termed the embedded zerotree wavelet (EZW) method EZW involves 
appfying a discrete wavelet transform to a source Image to decompose the image into a number of high frequency 
subbands and a lowest frequency subband, nomially at a number of different resolution levels or scales Zero tree 
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encoding is then applied to the subbands dependent upon predictbns of the self-similarity of coefficients across scales 
The zero-tree^encoded coefficients are then lossless encoded using arithmetic coding. 

However, both techniques utilise complex methods for encoding position information and employ lossless encod- 
ing Thus, the foregoing methods have a number of disadvantages including lack of flexibility and complexitv in the 
coding technique. r / 

In essence, the SWEET process of the present disclosure provides a method of representing a digital imaqe to 
provide a coded representation, the method comprising the steps of: 

transforming the digital image to derive a plurality of coefficients, each coefficient represented by a predefined bit 
sequence; 

selecting a portion of the plurality of coefficients as a region; and 

(a) seaming the significance of each bitplane of the select^ region from a most significant bitplane towards a 
least significant bitplane and providing a first token in the coded representation for each insignificant bitplane until 
a significant bitplane is determined, wherein a second token is provided in the coded representatfon for the sianif- 
icant bitplane; ^ 

(b) partitioning the selected region into two or nrwre subregions having a predetermined form, and setting each of 
the subregions as the selected region; and 

(c) repeating steps (a) and (b) commencing from the significant bitplane until the selected region has a predeter- 
mined size, wherein the coefficients of the selected region are coded and provided in the coded representation. 

Preferably, the linear transfomning step comprises applying a discrete wavelet transform to the digital image and 
the selected portion of the coefficients initially comprises the entire plurality of coefficients, or a subband of the coeffi- 
cients. 

Preferably, the first and second tokens comprise bit values of 0 and 1. respectively. 
Preferably, the subregions are equally sized, and optkjnally are square. 

Preferably, the predetermined size of the subreglon is a 1 x 1 coefncient. Still further, the 1 x 1 coefficient is encoded 
by outputting bits of the corresponding bit sequence beginning with the respective significant bitplane. Preferably only 
bits of tt^e corresponding bit sequence above a predetemiined minimum bit level are output In the coded represent^tkxi 

Preferabhj m step (c). steps (a) and (b) are repeated otherwise until each bitplane of the selected region has been 
scanned. Each bitplane of the selected regton above a minimum bit level is scanned. 

SWEET also provides a method of encoding a digital image, the method comprising the steps of: 

decomposing the image using a discrete wavelet transform to provide a number of subbands- 

for each subband. selecting the subband as an initial region and perfoiming the folkswing sub-steps- 

(a) checking rf a current bit level of the selected region is significant; 

(b) if the current bit level is significant, outputting a first token in a coded representation and partitioning the selected 
region into a number of equally sized sub-regbns. wherein each sub-regbn is processed as the selected reqion 

in turn; ^ 

(c) if the current bit level is insignificant, outputting a second token in the coded representatbn and selecting the 
next tawer bit level of the selected regfon as the current bit level; 

(d) repeating steps (a) to (c) until the current bit level is less than a specified minimum bit level, or the selected 
region has a predetemiined size and coefficients of the selected region are coded in the coded representation. 

Preferably the coefficients of the selected region are coded by representing each coefficient by the bits between 
the corresponding current bit level and the minimum bit level. 

SWEET qjigprovides a method of encoding a digital image, the method comprising the steps of: 

a) dividing the digital image into a plurality of bbcks; 

b) applying a subband transform to each of the blocks to provide a plurality of AC subband regbns. at one or more 
resolutions, and a DC subband region for each block; 

c) selection the DC subband region as a selected region and performing the following sub-steps- 

ca) checking if a current bitplane of the selected region is significant; 

cb) ifj^rrent bitplane is significant, outputting a first token in a coded representation and partitioning the selected 
regioiTWo a number of sub-regtons, wherein each sub-region is processed as the selected region in tum- 
cOif the current bitplane is insignificant, outputting a second token in the coded representation and selecting the 
next lower brtplane of the selected region as the current bitplane; 

cd) repeating sub-steps ca) tocc) until the current bitplane is less than a specified minimum bitplane, or the selected 
region has a. predetermined size and coefficients of the selected region are coded in the coded representation- 



15 



EP 0 859 335 A2 



d) selecting substantially all unccxJed AC subband regions as a remaining region of each block and scanning the 
significance of each bitplane of the remaining region from a most significant bitplane towards a least significant 
bitplane, and outputting the second token for each insignificant bitplane until a significant bitplane is determined; 

e) setting one or more AC subband regions, of a current resolution level, as a selected region and performing the 
sub-steps ca) to cd); 

f) repeating step e) until substantially all AC subbands of the current resolution level have been encoded; and 

g) repeating steps d) to f) until all AC subbands of each bk)ck have been encoded. 

Other aspects of SWEET include a method for decoding the coded representation of a digital image, an apparatus 
for representing a digital Image to provide the coded representation, an apparatus for decoding the coded represen- 
tation of a digital image, a computer software system for representing a digital image to provide the coded represen- 
tatbn. and a computer software system for decoding the coded representation, in which the coded representation Is 
produced in accordance with the first or second aspect of the invention. 

A high-level block diagram is illustrated in Fig. 22 to provide an overview of the embodiments of the SWEET 
methods. An input Image 1 90 is provided to the transform block 1 92. which is preferably a linear transform, to produce 
corresponding transform coefficients 194. A discrete wavelet transform (DWT) is preferably employed. 

The two-dimensk>nal DWT of an image Is a transfomi that represents the image using a low frequency approxi- 
mation to the image and three high frequency detail components. Conventionally, these components are termed sub- 
bands. Each of the four sub-images formed by the DWT is one quarter of the size of the original image. The low 
frequency image contains most of the information about the original Image. This Information, or energy compaction, 
is the feature of the discrete wavelet transform innage subbands that Is exploited for Image compression. 

The single-level DWT can be applied recursively to the low frequency Image, or subband. an artjitrary number of 
times. For example, a three-level DWT of the image is obtained by applying the transform once and then applying the 
DWT to the low subband resulting from the transformation. Thus, this results in 9 detailed subbands and one (very) 
low frequency subband. Even after three levels of DWTs. the resulting low frequency subband still contains a significant 
amount of infomnation of the original image, yet is 64 times smaller (V4 x V4 x V4), thereby effecting a factor of 64 in 
compression. 

However, other linear transformations for decorrelating image data may be practiced without departing from the 
scope of the invention. For example, a discrete cosine transfomi (DCT) can be practiced. The transform coefficients 
194, or more specifically the bit sequences representing their values, are then coded by the bit rearrangement block 
1 96 in an efficient fashion to provide the coded representation 198. 

The decoding process is simply the reverse of this encoding process. The encoded coefficients are decoded Into 
the transform coefficients. The (transform domain) image is then inverse transformed to form the original image, or 
some approximation thereof . 

SWEET is preferably be practiced using a conventional general-purpose computer, such as that shown In Figs. 1 , 
2 and 3, wherein the processes of Figs. 24 to 27 or Figs. 29 to 32 are implemented as software executing on the 
computer In particular, the steps of the coding and/or decoding methods are effected by instructions in the software 
that are carried out by the computer system 900. 

Before proceeding with a further description of the SWEET implementation, a brief review of terminology used 
hereinafter is provided. For a binary integer representation of a number, "bit n^or "bit number n" refers to the binary 
digit n places to the left of the least significant bit. For example, assuming an 8-bit binary representation, the decimal 
number 9 is represented as 00001001. In this number, bit 3 is equal to 1, while bits 2, 1. and 0 are equal to 0. 0, and 
1, respectively. 

For transform coding applications, the number of bits per coefficient required to represent the possible range of 
coefficients is determined by the linear transform and the resolution of each pixel (in bits per pixel) in the input image. 
This range of values for each pixel is typically large relative to the values of most of the transform coefficients, and 
thus many coefficients have a large number of leading zeros. For example, the number 9 has four leading zeros in a 
8-bit representation and has 12 leading zeros for a 16-bit representation. The embodiment of the invention provides a 
method and apparatus for representing (or coding) these leading zeros, for blocks of coefficients, in an efficient manner 
The remaining bits and sign of the number are encoded directly without modification. 

To simplify the description and not to obscure unnecessarily the invention, the transform coefficients are assumed 
hereinafter to be represented in an unsigned binary integer form, with a single sign bit. That is, the decimal numbers 
-9 and 9 are represented with the same bit sequence, namely 1001, with the former having a sign bit equal to 1 to 
indicate a negative value, and the latter having a sign bit equal to 0 to Indicate a positive value. The number of leading 
zeros is determined by the range of the transform coefficients. In using an Integer representation, the coefficients are 
implicitly already quantised to the nearest integer value, although this Is not necessary for this embodiment of the 
invention. Further, for the purpose of compression, any information contained in fractional bits is normally ignored. 

A region consists of a set of contiguous image coefficients. The term coefficient is used hereinafter interchangeably 



16 



EP 0859 335 A2 



with pixel, however, as will be well understood by a person skilled in the art, the former is typically used to refer to 
pixels in a transform domain (eg., a DWT domain). 

Encoding Process of SWEET 

Fig. 24 is a flow diagram illustrating the image encoding method according to the preferred embodiment. In step 
302. processing commences using an input image. In step 304, the input image is transformed using a linear transfor- 
mation, preferably a discrete wavelet transform. An initial region is defined to be the whole image. For example, in the 
case of a three-level DWT of the input image, the resulting coefficients consisting of the 10 subbands can be specified 
as the region. Alternatively each subband can be processed separately, setting each Initial region to the whole subband 
in question. 

In step 306, the most significant bit (msb) erf the largest absolute value of the transform coefficients is determined 
and a parameter, maxBitNumber, is set to this coefficient value. For example, if the largest transform coefficient has a 
binary value of 00001001 (decimal 9), the parameter maxBitNumber ls set to 3, since the msb Is bit number 3. Alter- 
natively, the parameter maxBitNumber may be set to be any value that is larger that the msb of the largest absolute 
value of the transform coefficients. 

Further, In step 306, a coding parameter, minBitNumbehs set to specify the coded image quality In particular, this 
coding parameter specifies the precision of every coefficient in the transformed image and can be varied as required. 
For example, a minBitNumberoi 3 provides a coarser reproduction of the original image than does a value of 1. 

Optionally, the technique involves step 308 which provides an output header in the coded representation of the 
Input image. Thus, in a practical implementation, header information is output as part of the coded representation. For 
example, the output header of the embodiment of the invention may contain Infonmation about the source image, 
Including the image height and width, the number of levels of the DWT the mean value of the DC subband, the /nax- 
S/YA/u/7?6er parameter, and the minBitNumber parameter. 

Beginning in step 310, each subband of the transformed image is coded separately in steps 312 and 314. Each 
subband is coded independently, in order from low frequency to high frequency For the DC subband. the mean value 
is removed prior to coding and coded into the header infomnatlon in step 308. In step 312, each subband is coded by 
setting an initial region as the whole subband. In step 314, the region Is encoded with the maxBitNumber and minBit- 
Number as parameters. This provides a hierarchal code, since lower resolution versions of the image are coded Into 
the bit stream before higher resolutions. Processing tennlnates in step 316. 

Fig. 25 is a detailed flow diagram of the procedure "Code Tegion{currentBltNumben minBitNumberjr called in step 
31 4 of Fig. 24 for coding each region, where maxBitNumber \s provided as the currentBitNumber, In step 402, process- 
ing commences. The Inputs to the region coding process of Fig. 25 Include the currentBitNumber and minBitNumber 
parameters. Preferably, the method is implemented as a recursive technique where the process is able to call itself 
with a selected region or sub-regbn. However, the process may implemented in a non-recursive manner without de- 
parting from the scope and spirit of the invention. 

In decision block 404, a check Is made to determine if the currentBitNumber parameter Is less than the minBit- 
Number parameter. Othenwise, If decision block 404 returns true (yes), nothing Is done and processing returns to the 
calling procedure In step 406. This condition indrcates that every coefficient In the selected region has a msb number 
less than mInBitNumben If decision block 404 returns false (no), processing continues at decision block 408, 

In decision block 408, a check Is made to determine If the selected region Is a 1 x 1 pixel. If decision block 408 
returns true (yes), processing continues at step 410. In step 410. the 1 x 1 pixel is coded. Preferably, this involves 
directly outputting the remaining bits above the minBitNumber \n the coded representation. In step 412, processing 
returns to the calling procedure. Othenwise, if deciskxi block 408 returns false (no), the regbn consists of more than 
one coefficient and processing continues at decision block 414. 

In decision block 414, the selected region Is checked to determine If it is significant. That Is, the significance of the 
region Is tested. The region is said to be insignificant if the msb number of each coefficient in the region is less than 
the value of the currentBitNumber parameXer. To make the concept of region significance precise, a mathematical 
definition is given in Equation (1 ). At a given bit number, say currentBitNumber=r\, the region is said to be insignificant if: 

|cj <2-. V/.y G fl. (1) 

where R denotes the region, and c^- denotes coefficient (i,j) in this region. 

If decision block 414 returns false (no), processing continues at step 416. In step 416, a value of 0 (or first token) 
Is output in the coded representation stream, and the cun-entBltNumber parameter is decremented by 1. That is, the 
next, lower bitplane of the region is selected for processing. Processing then continues at decision block 404. where 
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the region is again processed with the parameters currentBitNumber- 1 and minBitNumber. aherwise, if decision block 
414 returns true (yes), that is, the region is significant, processing continues at step 418. 

in step 418, a value of 1 (or second token) is output In the coded representation stream. In step 420, the selected 
region Is partitioned into a predetermined number (preferably, 4) of subregions using a specified partitioning algorithm. 
The partitioning algorithm used is known to the decoder. 

In this embodiment of the invention, square regions are used. A region is partitioned preferably into 4 equal-sized 
(square) subregions. As shown in Fig. 23, the selected region (R) 200 has a size of MxM coefficients and is partitioned 
into four equal-sized subregions 210, 212, 214 and 216. Each of the subregions has a size of NxN, where N is equal 
to M/2, Thrs is not always possible depending on the size and shape of the initial region. If this is not possible, the 
initial region can be partitioned into a number of square regions, each having dimensions that are a power of 2.'and 
encode these partitions separately. In any case, this initialisation has minimal effect on the overall results if done In an 
intelligent fashion. In an alternate embodiment, a different partitbn may be used that Is suitable for a block-based coder. 

In step 422. each subregbn is then coded with the same currentBitNumber an6 minBitNumber parameters. This 
IS preferably done by means of a recursive call to the procedure "Code re9ion(currentBitNumben minBitNumber)" of 
Fig. 25. This coding of subregions may be implemented in parallel or sequentially. In the latter case, the processing 
may commence from a low frequency subband to higher frequency subbands in turn. 

In the coded representation, a transform coefficient Is coded by simply outputting the pixel bits from the current- 
BitNumberXo the minBitNumber Preferably, a convention is followed whereby the sign is output only if some of the 
coefficient bits were non-zero. For example, if currentBitNumber = 3, minBitNumber = 1 . then -9 (00001001) is coded 
as "1 0 0" followed by a sign bit "1 

Decoding Process of Preferred Embodiment 

Fig. 26 is a flow diagram illustrating a method of decoding the coded representation of an image obtained using 
the process of Figs. 24 and 25. In step 502. processing commences using the coded representation. In step 504. the 
header informatton is read from the coded representation to determine the size of the original image, and hence the 
initial region size. Also, infomiatbn such as maxBitNumber {equa\ to the initial currentBitNumber m the coding process) 
and mmB/tNumber are input. Further information includes the mean value of the DC subband. 

In step 506. decoding of each subband is commenced by setting the region to the respective subbands in turn In 
step 508, the selected region is decoded using the maxBitNumber ar)6 minBitNumber pararr^eXers. In step 510 the 
inverse DWT is applied to the decoded selected region. Processing terminates in step 512. 

Fig. 27 is a detailed flow diagram of step 508 of Fig. 26 for decoding each region using procedure call "Decode 
reg\on(cunrentBitNumben minBitNumbei)\ where maxBitNumber \s provided as the currentBitNumber In step 602 
processing commences. The inputs to the region decoding process of Fig. 27 are the currentBitNumber ar^6 minBit- 
Number parameters. Again, the method is preferably implemented as a recursive technique. However, the process 
may be implemented in a non-recursive manner without departing from the scope and spirit of the invention. 

In decision block 604, a check Is made to determine if the cunrentBitNumber is less than the minBitNumber If 
decision block 604 retums true (yes), processing continues at step 606, where processing returns to the calling pro- 
cedure. Othenwise, if decision block 604 retums false (no), processing continues at decision block 608. 

In decision block 608. a check is made to determine if the selected region has a size of 1 x 1 pixels If decision 
block 608 retums true (yes), processing continues at step 610. In step 610. the 1 x 1 region is decoded Processing 
then retums to the calling procedure in step 61 2. If decision block 608 retums false (no), processing continues at step 
614. In step 614, a bit Is input from the coded representation. 

In decision block 61 6, a check is made to detemnine if the bit is equal to 1 . that is. the input is checked to determine 
If the region is significant. If decision block 616 retums false (no), processing continues at step 618. In step 618, the 
currentBitNumber is decremented, and processing continues at decision block 604. Otherwise, If decision block 616 
retums true (yes), processing continues at step 620. 1 n step 620. the region is partitioned into the predetermined number 
(preferably, 4) of sub-regions. In step 622. each of the sub-regions is decoded using the currentBitNumber au6 min- 
BitNumber In the preferred embodiment, this is carried out by means of a recursive call to the process illustrated in 
Fig. 27. In step 624, processing retums to the calling procedure. 

Thus, the bits output from the significance decisions in the encoder instruct the decoder on which path of the 
algorithm to take, thus mimicking the encoder. The pixels, and possible sign, are decoded by simply reading in the 
appropriate number of bits {currentBitNumber to minBitNumber aryd if some of these are non-zero the sign bit). 

Two-Dimensional Example 

The method effectively codes the leading zeros of most transform coefficients, while coding the bits from the most 
significant bit to the predetermined least significant bit. specified by the parameter minBitNumber, and the sign simply 
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as is. Thus, the preferred embodiment of the present invention advantageously represents the leading zeros. This 
method is very efficient In certain situations, namely for coding discrete wavelet transform image coefficients, which 
typically exhibit a large dynamic range. A few coefficients typically have very large values, while most have very small 
values. 

An example of encoding a two-dimensional region comprising 4x4 coefficients is described with reference to Figs. 
28A to 28D. The processing of the 4 x 4 region 700 of Fig. 28A is commenced with the maxBitNumber set to 7 since 
this is the largest bit number (bitplane) of all of the coefficients: 
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The minBitNumbens set to 3, for illustrative purposes. A header is preferably output in the coded representation 
containing the maxBitNumber and minBitNumben The process of coding the regbn 700 then follows. 

At currentBitNumber = 7, a one (1) is output since the region 700 is significant with respect to bit number 7 (see 
decision block 404, 408. and 414 and step 418 of Fig. 25). The region 700 is then partitioned into four sub-regions 
(see step 420 of Fig. 25): the top left region 710. the top right region 712, the bottom left region 714 and the bottom 
right region 716 of Fig. 28A. Each of the subregions consist of 2 x 2 coefficients. 

The sub-regions 710. 712, 714 and 716 of Fig. 28A are In turn coded in the predefined processing sequence shown 
of Fig. 28B. where a region 750 consists of four sub-regions 750A to 750D. The three arrows illustrated in the diagram 
indicate the order or sequence of processing, that is, top left sub-region 750A, top right sub-region 75013, bottom left 
sub-region 750C and bottom right sub-region 750D, respectively. 

The sub-region 710 of Fig. 28A is coded first (see step 422 of Fig. 4). For the currentBitNumber equa\ to 7, a one 
(1) IS output in the coded representation. The sub-region 710 is then partitbned into four 1 x 1 pixels having decimal 
values 200. 13, -13 and 3. Each of these coefficients is coded by outputting the bits of each coefficient from the cur- 
rentBitNumber = 7X0 the minBitNumber = 3 (see decision block 408 and step 41 0 of Fig. 28). A sign bit is then output 
If required. Thus, the decimal value is 200 is coded as 11001 followed by the sign bit 0. The coefficient value 13 is 
coded as 00001 with a sign bit 0. The coefficient value -1 3 is coded as 00001 with a sign bit 1 . Finally, the coefficient 
value 3 is coded as 00000 (without a sign bit). The coded representation of each coefficient includes the two "1" bits 
preceding the bits of coefficients -200" between the currentBitNumber and minBitNumben This completes the coding 
of the top left sub-region 710. The coded output at this state is: 



sijin hit 
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The header information is not shown in the foregoing expression. 

The top right sub-region 712 is then coded (per Fig. 28B). A zero (0) is output for each of currentBitNumber equa\ 
to 7, 6. 5, and 4, since the region 71 2 is insignificant with respect to these bit numbers. A one (1 ) is output at current- 
BitNumber^ 3, since this bitplane is significant with respect to bit number 3. The sub-region 71 2 Is partitioned into the 
four 1 x 1 pixels having values -11, -8, -4 and -3. These decimal values are coded as bit value 1 with sign bit 1, bit 
value 1 with sign bit 1 and bit values 0 and 0 without sign bits, respectively. Thus, at this stage, the coded representation 
is as follows: 



11 1100 100000 1 00000 110000000001 n 11^ 0 

-Ti^^^ 

The bottom left sub-region 714 is then encoded. A zero (0) is output for each of currentBitNumber equal to 7. 6. 
5, and 4, since the region 714 is insignificant with respect to these bit numbers. A one (1) is output a\ currentBitNumber 
equal to 3, since this bitplane is significant with respect to bit number 3. The sub-region 714 is then partitioned into 
four 1 X 1 pixels having values 8. 1 , 2 and -1 . These are coded respectively as binary value 1 with sign bit 0, and binary 
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values 0,0 and 0 without sign bits. 

Finally, the bottom right sub-region 716 having values -2, -2. -3, and -3 Is coded. A zero (0) Is output for each of 
currentBitNumber=7, 6. 5, 4 and 3 since the sub-region 716 is insignificant with respect to these bit numbers No sign 
bits are output. Thus, the coded representation is as follows: 



1 1 1 1001000001000001 10000000001 1 1 1 100000011000000000. 



depiltedfn pJycT'''^ encoding process to reconstruct the region from the coded representation as 

The decoding process can be made "smarter- In a number of ways. One such a "smarter" way is depicted In Fig 
28D. In this case, the magnitude of the non-zero coefficients is each increased by half of 2 to the power of minBitNumber. 
This IS depicted in Fig. 28D. In this manner, the "smart" decoding processing generally reduces the mean square error 
between the decoded and the original coefficients. Still further, the encoder can alternatively perform this (type of) 
operation, thereby leaving the decoder to use the simplest depicted In Fig. 28C. 

Encoding Process of Alternate SWEET 

A coding process according to an alternate SWEET method Is hereinafter described with reference to Figs 29 to 
32. The processes illustrated in the flow diagrams of Figs. 29 to 32 may be Implemented using software executing on 
a general-purpose computer 900 corresponding to that described. 

A discrete wavelet transfomi of an entire digital image can be pertomied on a block-by-block basis The result of 
the transformation upon each block is a set of coefficients, which are essentially equivalent to a set of spatially corre- 
sponding coefficients of a discrete wavelet transform of the entire Image. For example, from a predetermined set of 
coefficients of a DWTfor an entire image, a portk>n or block of the digital image can be reproduced to a specified detail 
Selecting the predetermined set of coefficients from the frequency domain amounts substantially to representing the 
corresponding portion of a digital image (the bk)ck) from the spatial domain. A block based DWT of a digital Image can 
be performed by decomposing an image into a plurality of blocks and applying the transform to each block independ- 
ently, thereby substantially evaluating those DWT coefficients relevant to the current spatial locatten The advantage 
of adopting a block-based transfomi approach is that a block can be subsequently encoded with minimal interaction 
(substantially independent) from another block of the image. Block-based techniques are inherently memonr localised 
and therefore are generally efficient when implemented using computer systems. 

Fig. 29 is a flow diagram illustrating the block-based encoding process according to the altemate embodiment of 
the invention. Processing commences at step 902. In step 904, a header is output. This infomiation pref erabV includes 
the image height and width, the block size, the number of levels of the DWT, and two coding parameters maxBilNumber 
and minBitNumber. Optionally, more or less header information may be used depending upon the applkatlon 

The coding parameter maxBitNumber can be selected in a variety of ways. If the block DWT Is performed on all 
image blocks prior to coding of any of them, the maxBitNumber can be chosen to be the MSB number of the largest 
coefficient across all DWT bkx:ks. For example, if the largest coefficient Is 10000001 (decimal value 129) the m^Bit- 
Numbens set to 7 since the MSB is bit number 7. Alternatively, a deterministic bound can be used which is detemiined 
by the transform and the resolution of the Input image. For example, with an 8-bit input image (level shifted to 7-bits 
plus sign) and the Haar transform, the largest MSB is bounded by J+7where Jis the number of levels of the DWT. If 
the blocks are small, the selection of this parameter can have a significant effect on compression. In some Instances 
SiS of selecting maxBitNumber may be employed. However, this depends upon the specific ap- 

The parameter /n/nerfA/i/m6©/-detemr)ines the compression ratio versus quality trade off and can be varied For 
example, for nearly orthogonal transfomis, a value of 3 provides adequate Image quality for 8-bit, grey-scale or 24-bit 

RGB images. * 

In step 906, the Image is decomposed into blocks (or an Image block is formed). The image is decomposed pref- 
erably into overlapping blocks. However, non-overlapping blocks may be employed. The block of coefficients can be 
as large as the whole original image, or as small as a block of 8x8 coefficients (for a three-level transform) For tow 
memory applications, a block that is as small as possible may be employed. Generally a block size of 16 coefficients 
IS sufficient for higher levels of compression with a three or four level DWT A block size of 8x8 coefficients with a three- 
level DWT can maintain good coding efficiency by employing differential pulse code modulation (DPCM) on the DC 
coefficient of each block. " 

In step 908, each block is level shifted and the transform is perfomied. Preferably, a DWT is employed The imaqe 
values are level shifted (for example, by 1 28 for an 8-bit image) to reduce or eliminate any undue mean bias, and each 
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spatial block of the Image Is transformed. For a DWT, usually some knowledge of the block surrounding the current 
block IS needed (and similarly for the inverse DWT), although this is not strictly required. 

In step 910, the block is coded using the maxBitNumberand minBitNumber parameters. Pixjcessing terminates 
in step 912. 

r r-®*^^^ ° '^'^^ ^ illustrated in detail in the flow diagram of Fig. 30. Inputs to the block coding process 

of Fig. 30 include the currentBitNumber and the m/nB/tA/umfter parameters. With reference to step 910 of Fig 29 the 
maxBitNumber ss input as the ci/rrenfS/YA/umtof parameter. Processing commences in step 1002 In decision btock 
1004, a check is made to determine if the currBntBitNumber is less than the minBitNumber. If decision block 1004 
returns true (yes), processing continues at step 1006. In step 1006. execution returns to the calling process, thereby 
indicating that every coefficient in the block has an MSB number less than the minBitNumber. Otherwise, if deciston 
block 1004 returns false (no), processing continues at decision block 1008. 

in deci^n block 1008. a check is made to detemiine if a current block Is significant. If decision block 1 008 returns 
false (no), processing continues at step 1 010. In step 1 010, a zero (0) is output in the coded representation and the 
currBntBnNumber is decremented, that is, the next lower bit plane is selected. Processing then continues at deciston 
block 1004. Otherwise, if decision block 1 008 returns true (yes) processing continues at step 1012 

Decision blocks 1004 and 1008 along with step 1010 enable the process to find the MSB number of the largest 
coefficient in the bkjck. A bkjck is insignificant with respect to the cunentBitNumber n the MSB number of every coef- 
ficient in the block is less than the currentBitNumber. This is repeated until the bitplane of the block is significant or the 
currentBitNumber \s less than the minBitNumber 

In step 1012, a one (1) is output in the coded representatkxi to Indicate the bitplane is signiffcant In step 1014 
the DC subband is coded. In step 1016. the bk)ck detail is coded using the parameters J. currentBitNumber and min- 
BitNumber In step 1018. execution returns to the calling procedure. Thus, given that the block is significant steps 
1012, 1014 and 1016 are carried out to use the (generalised) quadtree segmentation to find all coefHcients with an 
MSB number greater than the minBitNumber If the block is significant, it is partitioned into two "sub-blocks- the DC 
subband coefficients and the block consisting of the remaining coefficients, referred to as the "block detail" for level J 
since It represents the high frequency information about the block of level J at all lower levels 

Step 101 4 of Fig. 30 for codhg the DC subband is illustrated in detail by the flow diagram of Fig 32 That is Fig 
32 shows the process of coding a subband or sub-block using currentBitNumber ar\d minBitNumber parameters In 
step 1202, processing commences. In decision block 1204. a check is made to determine if the cun-entBitNumber is 
less than the mmBnNumber. If decision block 1204 returns trae (yes), processing continues at step 1 206 In step 1 206 
execution returns to the calling procedure. Otherwise, if decision block 1 204 returns false (no), processing continues 
at decision block 1208. 

In deciston block 1208 a check is made to determine if the (subband) block size is 1 x 1 pixels. If deciston block 
1208 returns true (yes), processing continues at step 1210. Instep 1210. the 1 x 1 pixel is coded. This involves outputting 
the bits between the cunentBitNumberar\dVr\e minBitNumber. inclusive, followed by a sign bit if necessary. Processing 
then retums to the calling procedure in step 1212. Otherarise. if decision block 1208 returns false (no) processing 
continues at decision block 1214. » /• r » 

In deciston block 1214, a check is made to determine if the (subband) block is significant. If decision btock 1214 
retums false (no), processing continues at step 1216. In step 1216, a zero (0) is output in the coded representatton 
and the currentBitNumber is decremented. Processing then continues at deciston block 1204. Otherwise, if decision 
block 1214 returns true (yes), processing continues at step 1218, 

In step 1218, a one (1) is output in the coded representation to indicate that the (subband) block is significant In 
step 1220, the (subband) block is partitioned into four sub-blocks. In step 1222. each sub-block is coded using the 
parameters currentBitNumberatvi minBitNumber. by means of a recursive call to the process of Fig 32 In step 1224 
execution retums the calling procedure. 

Thus, in Hi^rocess of Fig. 32. a subband or sub-block thereof is coded. The largest MSB number is isolated as 
before. If the sub-block consists of only one pixel, it is coded as a single coefficient. Otherwise, the currentBitNumber 
IS decremented and a zero (0) is output in the coded representation until the cunentBitNumber is less than the min- 
BitNumber. or the subband (sub-block) is significant. If the subband (sub-block) is significant, it is partitioned into four 
(as close to equal as possible) sub-block, and these are coded in turn. A single coefficient, for example the DC coef- 
ficient, IS encoded by outputting the coefficient bits from the ci//renfB/fA/um6ertothe minBitNumber. Again the siqn is 
preferably only output if some of the coefficient bits are non-zero. 

Steygj^gLof Fig. 30 for coding block detail Is illustrated by the flow diagram of Fig. 31. In step 1102 processing 
commences. In decision btock 1104, a check is made todetermine if the currentBitNumberis less than the m/nS/f/Vi/m- 
ber If decision block 1104 retums true (yes), execution retums to the calling procedure in step 1106 aherwise if 
decision block 1104 returns false (no), processing continues at decision block 1108. 

Indecisionblock1108,acheckismadetodetermlneif thebIock(detail)issignificant. If decision block IIOBretums 
false (no), processing continues at step 1110. In step llio, a zero (0) is output in the coded representation and the 
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currentBitNumber is decremented. Processing then continues at decision block 1104. aherwise, if decision block 1108 
returns true (yes), processing continues at step 1112. 

i 11 J" ^'^f Vl^' ! °u® '^^'^ representation to indicate that the block (detail) Is significant In step 

1114. each of the high-low (HL), low-high (LH), and high-high (HH) frequency subbands is coded The HL LH and HH 
frequency subbands of each resolution are commonly referred to as AC subbands. Each of these subbaiids is coded 
.n accordance with the process of Fig. 12. In step 1116. the block detail is coded using the parameters J-l, currentBit- 
Nvmberani mmSilNumber (if the block detail exists) by means of a recursive call to the process illustrated in Fig 11 
Execution retumstothecalllngprocedurein step 1118. "-iBumng. n. 

Thus, the block detail for level J is processed to first isolate the MSB number of the largest coefficient This is done 
?hT.r""?.^ c^/rrenfftf/Vi/mfterand outputting zeros until the block is significant. The bkx;k is then partitioned 
into the three high frequency subbands at level J and the block detail for level J-l {it J-l is greater than 0) This 
partitioning approach is motivated by the so called 1/f type spectral models 

HocJSlJ^lf ^"^^^ ^"^'"^'^ implemented by mimfcking the coding process 

described with reference to Figs. 29 to 32. " "«iiia niw-ess 

manne^ in which the representation is suitable for storing and/or transmitting images. The encoding techniques can 
1^°"'°^"* ^ ^"^^ °' coefficients, and to provide an efficient representation by repre- 

senting an image in the discrete wavelet transfom, domain. In particular, the embodiments provide methods and ap- 
para uses for representing (or coding) leading zeros of blocks of transform coefHcients obtained from an input image 
2er noSrr L'TJ T"^ ^ reproduction of the original image for a given size code and 

eS inni °^'"^i 2'' '"'^"'""^ ^"'^'"^ ^ transformation are 

de^m naTul ^ °' T'^ ^''"^ advantageous aspects of the embodiment include the 

fnr^L ? K "^^^ Of «^ing subbands. the advantageous aspects of the invention 

include hierarchical coding of each subband separately. uivBinion 

Summarv 

nf Jl?" T^'^"' ^ number of arrangements are disclosed that provide for the manlpulatfon 

scahng. Such methods find partKular application in image-based file management systems and browsing-type appli- 

trvrufdi^tsztr''^'"^"^^^^^^^ 

.nrtlUllT^^^ "^^^"^u^ ^ """"^^^ °^ embodiments of the present Invention, however, modificattens 



Claims 

1. A method of zooming an image on a display device, said method comprising the steps of: 

displaying a first image at a first predetermined size; 

stretching or shrinking the first image to one or more predetermined image sizes- 
displaying at least one of said stretched or shrunken images; and 
displaying a second image at a second predetermined image size. 

2. A method according to claim 1 , wherein saW first image Is stored on a memory storage means in an encoded 
fomiatandthestepsofdisplayingfurtherincludesadecodlngofthefirstimagefordisplay '"^^^<^^ 

3. A method according to claim 1, wherein said second image is stored on a memory storage means in an encoded 
fonnat and the steps of displaying further includes a decoding of the second image for display 

4. A method according to claim 2 or 3. wherein the encoded foimat is a compressed image format. 

5. A method according to claim 4. wherein the compressed image format is a wavelet coding technique, 
definedr""^ ««^°^di"9 to claim 5, wherein said wavelet coding technique comprises the SWEET method (as herein 
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7. A method according to claim 4 or 5, wherein the compressed image format Is achieved by a JPEG coding 
technique. 

8. A method according to claim 1 , wherein the first and second image are different resolutions of a single Image. 

9. A method according to claim 7, wherein said single image is stored as a single compressed image and wherein 
the first and second Images are extracted from the single compressed image. 

10. A method according to claim 1, wherein said stretching or shrinking comprises adjusting the size of the first 
image by manipulating pixels of the first image. 

11. A method according to claim 10, wherein said stretching comprising mathematically manipulating values of 
groups of pixels to provide at least one pixel value, and replicating said at least one pixel value. 

12. A method according to claim 10, wherein said shrinking comprises mathematically manipulating groups of 
pixels to provide at least one pixel value, and assigning said one pixel vaiue to a limited number of pixels smaller 
that contained in said groups. 

13. A method according to claim 12 wherein, said assigning comprises discarding pixels from said image. 

14. A method according to claim 3, wherein said Image Is coded into said encoded format by a method of repre- 
senting a digital image to provide a coded representation, said method comprising the steps of: 

transforming said digital image to derive a plurality of coefficients, each coefficient represented by a predefined 
bit sequence; 

selecting a portion of said plurality of coefficients as a region; 

(a) scanning the significance of each bitplane of said selected region from a most significant bitplane towards 
a least significant bitplane and providing a first token in said coded representation for each insignificant bitplane 
until a significant bitplane is determined, wherein a second token is provided in said coded representation for 
said significant bitplane; 

(b) partitioning said selected region Into two or more subreglons having a predetermined form, and setting 
each of said subreglons as said selected region; 

(c) repeating steps (a) and (b) commencing from said significant bitplane until said selected region has a 
predetermined size, wherein said coefficients of said selected regk)n are coded and provided in said coded 
representation. 

15. A method according to claim 14, wherein said transforming step comprises applying a discrete wavelet trans- 
form to said digital image. 

16. A method according to claim 14, wherein said regton comprises said entire plurality of coefficients. 

17. A method according to claim 15, wherein said portion comprises a subband of said plurality of coefficients. 

18. A method according to claim 14, wherein said first and second tokens comprise bit values of 0 and 1 , respec- 
tively. 

19. A method according to claim 14, wherein said subreglons are equally sized. 

20. A method according to claim 1 9, wherein said subregions are square. 

21. A method according to claim 14, wherein said predetermined size of said subregion Is a 1 x 1 coefficient. 

22. A method according to claim 21 , wherein said 1 x 1 coefficient is encoded by outputting bits of said correspond- 
ing bit sequence beginning with the respective significant bitplane. 

23. A method according to claim 22, wherein only bits of said corresponding bit sequence above a predetermined 
minimum bit level are output in said coded representation. 
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24. A methcxJ according to claim 14, wherein, in step (c), steps (a) and (b) are repeated otherwise until each 
bitplane of said selected region has been scanned. 

26. A method according to claim 24. wherein each bitplane of said selected region above a minimum bit level has 
been scanned. 

26. A method of zooming an image on a display device, said method comprising the steps of: 

(a) decompressing a compressed true image to a first representation of said image at a predetermined size, 
and displaying said first representation; 

(b) if the predetermined size is smaller than a desired image size then 

(I) stretching the representation to a larger image size and displaying said stretched representation; and 
(ii) repeating step (b)(i) until said desired image size is reached; 

(c) if the predetermined size is larger than a desired image size then 

(i) shrinking the representation to a smaller image size and displaying said shrunk representation; and 
(il) repeating step (c)(i) until said desired image size is reached. 

27. A method according to claim 26 comprising the further step, after step (c), of: 

(d) decompressing the compressed true image to a further representation at a said desired size and displaying 

said further representation. 

27. A method according to claim 26, wherein step (d) occurs substantially simultaneously during steps (a) (b) and 
(c). 

28. Apparatus for zooming an image for display on a display device, said apparatus comprising: 

means for displaying a first representation of sakl Image at a first predetermined size 
means for stretching or shrinking said first representation to at least one further predetermined size; 
means for displaying said stretched or shrunken representation at each said further predetermined size, and 
means for displaying a second representation of said image at a second predetermined size. 

29. Apparatus according to claim 28, wherein means for displaying said second representation operates substan- 
tially simultaneously with the other said means. 

30. A computer program product comprising a computer readable medium having a computer program recorded 
thereon for zooming Images for display on a display device, said computer program product comprising: 

means for displaying a first representation of said image at a first predetermined size 
means for stretching or shrinking said first representation to at least one further predetermined size; 
means for displaying said stretched or shrunken representation at each said further predetermined size, and 
means for displaying a second representation of said image at a second predetermined size. 

31 . A method of zooming thumbnails of digital Images for use in image browsing, said method comprising the steps 
of: 

laying out saki thumbnails in at least one containment area having the same aspect ratio as an available 
display area of a display means; and 

scaling said thumbnails contained in said at least one containment area to fit within the available area of said 
containment area. 

32. A method according to claim 31, further comprising the step of increasing the scale of said thumbnails If in- 
creased detail of said thumbnails Is required for assessment. 

33. A method according to claim 32, wherein said thumbnails are scaled to be as large as possible and fit within 
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said available display area. 

34. A method according to any one of claims 31 to 33, further comprising the step of organising said thumbnails 
Into two or more groups in respective containment areas. 

: 35. A method according to claim 34, wherein said groups have a hierarchical structure. 

36. A method according to claim 34 or 35, wherein one or more groups laid out in respective containment areas 
comprises two or more (sub)groups of thumbnails laid out in respective (sub)conta[nment areas, each (sub)con- 
tainment area having the same aspect ratio as the parent containment area and said displayable area. 

37. A method according to any one of claims 31 to 36, wherein one or more of said digital images is generated 
using a digital image capture device. 

38. A method according to claim 31 wherein said scaling comprises zooming said thumbnails according to the 
methodof claim 1. 

39. Apparatus for zooming thumbnails of digital Images for use in innage browsing, said thumbnails being display- 
able on a display means, said apparatus comprising: 

means for laying out said thumbnails in at least one containment area having the same aspect ratio as an 
available display area of a display means; and 

means for scaling said thumbnails contained in said at least one containment area to fit within the available 
area of said containment area. 

40. Apparatus according to claim 39, further comprising: 

a computer; 

a computer readable medium having recorded thereon a computer program for zooming said digital images, 
said computer program carrying out the method according to any one of claims 31 to 38, wherein said computer 
program can be loaded into and run by said computer to implement said apparatus. 

41 . A computer program product having a computer readable medium having a computer program recorded ther- 
eon for zooming thumbnails of digital images for use in image browsing, said computer program product comprising: 

means for laying out said thumbnails In at least one containment area having the same aspect ratio as an 
available display area of a display means; and 

means for scaling said thumbnails contained in said at least one containment area to fit within the available 
area of said containment area. 

42. A method of scaling thumbnails representing digital Images, said method comprising the step of: 

encoding a digital image using a hierarchical representation to provide a thumbnail. 

43. A method according to claim 42, further comprising the step of decoding said hierarchical representation of 
said digital image at a predetermined one of a plurality of scales to provide said thumbnail having a desired size. 

44. A method according to claim 43, wherein the scale of said decoded hierarchical representation Is equal to or 
greater than the desired size of said thumbnail. 

45. A method according to claim 43. further comprising the step of downsampling the decoded hierarchical repre- 
sentation to provide the deslred-size thumbnail if the scale of said decoded hierarchical representation Is greater 
than the desired size of said thumbnail. 

46. A method according to claim 43, further comprising the step of upsampling the decoded hierarchical represen- 
tation to provide the deslred-size thumbnail if the scale of said decoded hierarchical representation is less than 
the desired size of said thumbnail. 

47. A method according to claim 42, further comprising the step of decoding said hierarchical representation of 
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sjddigtel image attwodiflerentadjacentscalesofaplurality Of sc^^^^ 

48 A mothod according to claim 47, further comprising the step of Interpolating between the two decoded hierar 

ad'LLTcTJr^^^^^^^^ 

Ho.nL^ h ? T ^' ^®P^«««"'3"0"s is greater than the desired thumbnail size and the other of sawLo 
decoded hierarchical representations is less than the desired thumbnail size. 

49. A method according to any one of cbims 43 to 48. further comprising the step of displaying said thumbnail. 

a^rSdSrrtrsf^^^r^a^ir. 

51. A method according to claim 50, wherein the encoding comprises the steps of: 

iirs^urcer''"'''''"'"''''"'^"''''""^^^ 

selecting a portion of said plurality of coefficients as a region- 

irorrsr^ir^^^^^^^ 

Sidr^rH""' ^V"^- '''^ "°'"'"^«"9 fro'" said significant bitplane until said selected region has a 
rep?ien™n """"" """^'"'^ °' ^^''^ ^^^^ ^^^^ ^ P-ided L Z^^ 

ftfto'S'diSanSs^ 

53. A method according to claim 51 . wherein said regbn comprises said entire plurality of coefficients. 

54. A method according to claim 52. wherein sakJ portion compr«es a subband of sakJ pluraltty of coelficients. 
A method according to Claim 51 . Wherein said first and second totens comprise W 

56. A method according to claim 51 . wherein said subregions are equally sized. 

57. A method according to claim 56, wherein said subregions are square. 

68. A method according to claim 51 . wherein saW predetemiined size of said subregk,n is a 1 x 1 coefficient. 

f:-brsrr:*ct:^ 

^^^-^ each 

Sen^ ^nnT'"'"' '"'""^ '''' "^"^'^"^ °' -9'- above a minimum bit level has 

63. A method according to claim 50. wherein said encoding comprises the steps of: 
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decomposing said Image using a subband transform to provide a number of subbands: 

for each subband, selecting said subband as an initial region and performing the following sub-steps: 

(a) checking if a current bit level of the selected region Is significant; 

(b) If said current bit level is significant, outputting a first token In a coded representation and partitioning said 
selected region into a number of equally sized sub-regions, wherein each sub-regbn is processed as said 
selected region in turn; 

(c) If said current bit level is insignificant, outputting a second token in said coded representation and selecting 
the next lower bit level of said selected region as said current bit level; 

(d) repeating steps (a) to (c) until said current bit level is less than a specified minimum bit level, or said selected 
regbn has a predetermined size and coefficients of said selected region are coded In said coded representa- 
tion. 

64. A method according to claim 63, wherein said coefficients of said selected region are coded by representing 
each coefficient by the bits between the corresponding current bit level and the minimum bit level. 

65. A method according to claim 63 or 64, wherein sakl subband transform comprises a discrete wavelet transform. 

66. A method according to claim 50, wherein said encoding comprises the steps of: 

a) dividing the digital image into a plurality of blocks; 

b) applying a subband transform to each of said blocks to provide a plurality of AC subband regions, at one 
or more resolutions, and a DC subband region for each block; 

c) selection said DC subband region as a selected region and perfonming the following sub-steps: 

ca) checking if current bitplane of the selected region Is significant; 

cb) If said current bitplane is significant, outputting a first token in a coded representation and partitioning 
said selected region Into a number of sub-regions, wherein each sub-region is processed as the selected 
region in turn; 

cc) if said current bitplane is insignificant, outputting a second token in the coded representatbn and 
selecting the next lower bitplane of the selected region as the current bitplane; 

cd) repeating sub-steps ca) to cc) until said current bitplane is less than a specified minimum bitplane. or 
said selected region has a predetermined size and coefficients of said selected region are coded in said 
coded representation; 

d) selecting substantially all uncoded AC subband regions as a remaining region of each block and scanning 

the significance of each bitplane of the remaining region from a most significant bitplane towards a least sig- 
nificant bitplane, and outputting the second token for each insignificant bitplane until a significant bitplane is 
determined; 

e) setting one or more AC subband regions, of a current resolution level, as a selected region and perfomiing ' 
the sub-steps ca) to cd); 

f) repeating step c) until substantially all AC subbands of the current resolution level have been encoded; and 

g) repeating steps d) to f) until all AC subbands of each block have been encoded. 

67. A method according to any of claims 43 to 66, further comprising the step of caching said hierarchical repre- 
sentation and/or said decoded hierarchical representation. 

68. A method according to claim 67, further comprising the step of caching intermediate size or scaled versions 
of said decoded hierarchical representation. 

69. Apparatus for scaling thumbnails representing digital images, said apparatus comprising: 

means for storing a thumbnail, said thumbnail comprising a hierarchically encoded representation of a digital 
image; and 

means for decoding said hierarchical representation of said digital image at at least a predetermined one of 
a plurality of scales to provide said thumbnail having a desired size. 

70. Apparatus according to claim 69, wherein the scale of said decoded hierarchical representation is equal to or 
greater than the desired size of said thumbnail. 
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71. Apparatus according to claim 69, further comprising means for downsampiing the decoded hierarchical reo- 
than the destred size of said thumbnail. y'««*ier 

litSrorcJSriS L^ir^'"' "^'""^ "P^^'^P""^ the decoded hierarchical repre- 

thetsrz:s„r 

73 Apparatus according to claim 69. wherein said decoding means decodes said hierarchical representation of 
s^ddigrtal^ageattwodifferentadjacentscalesofapluralftyofscalestop^^^ 

74 Apparatus according to claim 73. further comprising means for interpolating between the two decoded hierar- 
2 i'S^wo d3*^?' having different adjacent scales to provide the desired-L thumbnail "e ^ of o^Ti 
decoded hierarchical representations is less than the desired thumbnail size. 
7S.Apparatusacco,dingtoanyoneof claims 69to74.furthercomprising^^^^^^ 

t?p'.sr.reSre:^^^^^^^^ 

d'^crrieur l^^to^d -m'agT ^"^""^ "^^^"^ '"^'"^^ "^^^^ ^^'^^ ^ 

78. Apparatus according to claim 77. wherein encoding comprises SWEET encoding (as herein defined). 

7a Apparatus according to any of claims 69 to 78, further comprising means for caching said hierarchical reore- 
sentation and/or said decoded hierarchical representation. a n.erarcnicai repre- 

means for caching intermediate size or scaled versions 
of said decoded hierarchical representation. vsrsions 

81. A computer program product comprising a computer readable medium having a computer program recorded 
thereon for scaling thumbnails representing digital images, said computer program product coi^^: 

liage! and ^ ^ hierarchically encoded representation of a digital 

means for decoding said hierarchical representation of said digital image at at least a predetermined one of 
a plurality of scales to provide said thumbnail having a desired size. eoeierminea one of 

82. The computer program product according to claim 81 , wherein the scale of said decoded hierarchical reore- 
sentation is equal to or greater than the desired size of said thumbnail. Hierarchical repre- 
ss. The cornputer program product according to claim 81 , further comprising means for downsampiing the decoded 
hieramhical representation to provide the desired-size thumbnail if the scile of said decoded hSrchfcaf^pre 
sentaton is greater than the desired size of said thumbnail. nierarcnicai repre 

84 The computer program product according to claim 81. further comprising means for upsampling the decoded 
hierarchical representation to provkJe the desiredH size thumbnail if the scale of said dec<^ed hiSrcWcaSr^ 
sentation is less than the desired size of said thumbnail. nierarcnicai repre- 

ss. The computer program product according to claim 81 . wherein said decoding means decodes said hierarchical 

86. The computer program product according to claim 85. further comprising means for interpolating between the 
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two decoded hierarchical representations having different adjacent scales to provide the desired-size thumbnail if 
the scale of one of said two decoded hierarchical representations is greater than the desired thumbnail size and 
the other of said two decoded hierarchical representations is less than the desired thumbnail size. 

87. The computer program product according to any one of claims 81 to 84, further comprising display means for 
displaying said thumbnail. 

88. The computer program product according to any one of claims 81 to 87, further comprising means for encoding 
said digital image to provide said hierarchically encoded representation. 

89. The computer program product according to claim 88, wherein said encoding means Includes means for ap- 
plying aTiierarchical discrete wavelet transform to said Image. 

90. The computer program product according to claim 89, wherein encoding comprises SWEET encoding (as 
herein defined). 

91 . The computer program product according to any of claims 81 to 90, further comprising means for caching said 
hierarchical representation and/or said decoded hierarchical representation. 

92. The computer program product according to claim 91 , further comprising means for caching intermediate size 
or scaled versions erf said decoded hierarchical representation. 

93. An irmge-based computerized file browsing system comprising: 

a storage device for retaining a plurality of Images at least group of which are stored in an encoded format; 
a processor for running an operating system incorporating a file management arrangement wherein true im- 
ages retained in said storage device are depicted as thumbnail representations when viewed via said file ' 
management system, 
characterised by 

an image manipulation arrangement whereby said thumbnail representations are alterable in size and only a 
limited number of size alterations result from decoding from said encoded format, the remainder arising from 
pixel manipulation of intermediate sized images sized between a starting size and an ending size. 

94. A system according to claim 93, wherein said file management arrangement groups certain ones of said rep- 
resentations Into containment areas whereby an aspect ratio of said containment areas corresponds to that of a 
display device upon which said Images and representations are reproduced. 

95. A system according to claim 94, wherein representations within a containment area can be treated as a single 
representation when the display size of the containment area is being altered. 

96. A system according to claim 93, wherein said images are encoded according to a hierarchical coding scheme 
that provides for incremental decoding of different resolutions of an image and said image manipulation arrange- 
ment decodes said thumbnail representations via one of said Increments when perfonning a size alternation. 

97. An image storage and retrieval system comprising: 

image memory means providing access in use to a plurality of image data files; 

display means operable to display thumbnail Images at least partially representative of said Image data files; 
processing means operable to variably control at least one of the size, shape, grouping and Information content 
of the displayed thumbnail images; and 

output means operable to output data from a selected one of the image data files corresponding to a user 
selection of a corresponding thumbnail Image. 
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